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(57) Abstract 

Hie invention describes a method for isolating one or 
more genetic elements encoding a gene product having a de- 
sired activity, comprising the steps of: (a) compartmentalis- 
ing genetic elements into microcapsules; (b) expressing the 
genetic elements to produce their respective gene products 
within the microcapsules; (c) sorting the genetic elements 
which produce the gene product having a desired activity. 
The invention enables the in vitro evolution of nucleic acids 
by repeated mutagenesis and iterative applications of the 
method of the invention. 
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IN VITRO SORTING METHOD 

The present invention relates to methods for use in in vitro evolution of molecular libraries. 
In particular, the present invention relates to methods of selecting nucleic acids encoding gene 
products in which the nucleic acid and the activity of the encoded gene product are linked by 
compartmentation. 

Evolution requires the generation of genetic diversity (diversity in nucleic acid) followed by 
the selection of those nucleic acids which result in beneficial characteristics. Because the 
nucleic acid and the activity of the encoded gene product of an organism are physically linked 
(the nucleic acids being confined within the cells which they encode) multiple rounds of 
mutation and selection can result in the progressive survival of organisms with increasing 
fitness. Systems for rapid evolution of nucleic acids or proteins in vitro must mimic this 
process at the molecular level in that the nucleic acid and the activity of the encoded gene 
product must be linked and the activity of the gene product must be selectable. 

Recent advances in molecular biology have allowed some molecules to be co-selected 
according to their properties along with the nucleic acids that encode them. The selected 
nucleic acids can subsequently be cloned for further analysis or use, or subjected to additional 
rounds of mutation and selection. 

Common to these methods is the establishment of large libraries of nucleic acids. Molecules 
having the desired characteristics (activity) can be isolated through selection regimes that 
select for the desired activity of the encoded gene product, such as a desired biochemical or 
biological activity, for example binding activity. 

Phage display technology has been highly successful as providing a vehicle that allows for the 
selection of a displayed protein by providing the essential link between nucleic acid and the 
activity of the encoded gene product (Smith, 1985; Bass et at. , 1990; McCafferty et at., 
1990; for review see Clackson and Wells, 1994). Filamentous phage particles act as genetic 
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display packages with proteins on the outside and the genetic elements which encode them on 
the inside. The tight linkage between nucleic acid and the activity of the encoded gene 
product is a result of the assembly of the phage within bacteria. As individual bacteria are 
rarely multiply infected, in most cases all the phage produced from an individual bacterium 
will carry the same genetic element and display the same protein. 

However, phage display relies upon the creation of nucleic acid libraries in vivo in bacteria. 
Thus, the practical limitation on library size allowed by phage display technology is of the 
order of 10 7 to 10 u , even taking advantage of X phage vectors with excisable filamentous 
phage replicons. The technique has mainly been applied to selection of molecules with 
binding activity. A small number of proteins with catalytic activity have also been isolated 
using this technique, however, in no case was selection directly for the desired catalytic 
activity, but either for binding to a transition-state analogue (Widersten and Mannervik, 
1995) or reaction with a suicide inhibitor (Soumillion et al., 1994; Janda et al., 1997). 

Specific peptide ligands have been selected for binding to receptors by affinity selection using 
large libraries of peptides linked to the C terminus of the lac repressor Lacl (Cull et al., 
1992). When expressed in E. coli the repressor protein physically links the ligand to the 
encoding plasmid by binding to a lac operator sequence on the plasmid. 

An entirely in vitro polysome display system has also been reported (Mattheakis et al., 1994) 
in which nascent peptides are physically attached via the ribosome to the RNA which encodes 
them. 

However, the scope of the above systems is limited to the selection of proteins and 
furthermore does not allow direct selection for activities other than binding, for example 
catalytic or regulatory activity. 



In vitro RNA selection and evolution (Ellington and Szostak, 1990), sometimes referred to as 
SELEX (systematic evolution of ligands by exponential enrichment) (Tuerk and Gold, 1990) 
allows for selection for both binding and chemical activity, but only for nucleic acids. When 
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selection is for binding, a pool of nucleic acids is incubated with immobilised substrate. 
Non-binders are washed away, then the binders are released, amplified and the whole process 
is repeated in iterative steps to enrich for better binding sequences. This method can also be 
adapted to allow isolation of catalytic RNA and DNA (Green and Szostak. 1992; for reviews 
see Chapman and Szostak, 1994; Joyce, 1994; Gold etal, 1995; Moore, 1995). 

However, selection for "catalytic" or binding activity using SELEX is only possible because 
the same molecule performs the dual role of carrying the genetic information and being the 
catalyst or binding molecule (aptamer). When selection is for "auto-catalysis" the same 
molecule must also perform the third role of being a substrate. Since the genetic element 
must play the role of both the substrate and the catalyst, selection is only possible for single 
turnover events. Because the "catalyst" is in this process itself modified, it is by definition 
not a true catalyst. Additionally, proteins may not be selected using the SELEX procedure. 
The range of catalysts, substrates and reactions which can be selected is therefore severely 
limited. 



Those of the above methods that allow for iterative rounds of mutation and selection are 
mimicking in vitro mechanisms usually ascribed to the process of evolution: iterative 
variation, progressive selection for a desired the activity and replication. However, none of 
the methods so far developed have provided molecules of comparable diversity and functional 
efficacy to those that are found naturally. Additionally, there are no man-made "evolution" 
systems which can evolve both nucleic acids and proteins to effect the full range of 
biochemical and biological activities (for example, binding, catalytic and regulatory activities) 
and that can combine several processes leading to a desired product or activity. 

There is thus a great need for an in vitro system that overcomes the limitations discussed 
above. 



WO 99/02671 

4 

BRIEF DESCRIPTION OF THE INVENTION 



PCT/GB98/01889 



According to a first aspect of the present invention, there is provided a method for isolating 
one or more genetic elements encoding a gene product having a desired activity, 
comprising the steps of: 

(a) compartmentalising genetic elements into microcapsules; 

(b) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

(c) sorting the genetic elements which produce the gene product(s) having the 
desired activity. 

The microcapsules according to the present invention compartmentalise genetic elements and 
gene products such that they remain physically linked together. Surprisingly, nucleic acid 
expression remains possible within the artificial microcapsules allowing for isolation of 
nucleic acid on the basis if the activity of the gene product which it encodes. 

As used herein, a genetic element is a molecule or molecular construct comprising a nucleic 
acid. The genetic elements of the present invention may comprise any nucleic acid (for 
example, DNA, RNA or any analogue, natural or artificial, thereof). The nucleic acid 
component of the genetic element may moreover be linked, covalently or non-covalently, to 
one or more molecules or structures, including proteins, chemical entities and groups, 
solid-phase supports such as magnetic beads, and the like. In the method of the invention, 
these structures or molecules can be designed to assist in the sorting and/or isolation of the 
genetic element encoding a gene product with the desired activity. 

Expression, as used herein, is used in its broadest meaning, to signify that a nucleic acid 
contained in the genetic element is converted into its gene product. Thus, where the nucleic 
acid is DNA, expression refers to the transcription of the DNA into RNA; where this RNA 
codes for protein, expression may also refer to the translation of the RNA into protein. 
Where the nucleic acid is RNA, expression may refer to the replication of this RNA into 
further RNA copies, the reverse transcription of the RNA into DNA and optionally the 
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transcription of this DNA into further RNA moIecule( S ), as well as optionally the translation 
of any of the RNA species produced into protein. Preferably, therefore, expression is 
perfonned by one or more processes selected from the group consisting of transcription, 
reverse transcription, replication and translation. 

Expression of the genetic element may thus be directed into either DNA, RNA or protein, or 
a nucleic acid or protein staining unnatural bases or amino acids (the gene product) wkhin 
the microcapsule of the invention, so that the gene product is confined within the same 
microcapsule as the genetic element. 

The genetic element and the gene product thereby encoded are linked by confining each 
genetic element and the respective gene product encoded by the genetic element within the 
same microcapsule. In this way the gene product in one microcapsule cannot cause a change 
in any other microcapsules. 

The term "microcapsule" is used herein in accordance with the meaning normally assigned 
thereto in the art and further described hereinbelow. In essence, however, a microcapsule is 
an artificial compartment whose delimiting borders restrict the exchange of the components of 
the molecular mechanisms described herein which allow the sorting of genetic elements 
according to the function of the gene products which they encode. 

Preferably, the microcapsules used in the method of the present invention will be capable of 
being produced in very large numbers, and thereby to compartmentalise a library of genetic 
elements which encodes a repertoire of gene products. 

According to a preferred embodiment of the first aspect of the present invention, the sorting 
of genetic elements may be performed in one of essentially four techniques. 

0) In a first embodiment, the microcapsules are sorted according to an activity of the 
gene product or derivative thereof which makes the microcapsule detectable as a whole. 
Accordingly, the invention provides a method according to the first aspect of the invention 
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wherein a gene product with the desired activity induces a change in the microcapsule, or a 
modification of one or more molecules within the microcapsule, which enables the 
microcapsule containing the gene product and the genetic element encoding it to be sorted. 
In this embodiment, therefore, the microcapsules are physically sorted from each other 
5 according to the activity of the gene product(s) expressed from the genetic elements) 
contained therein, which makes it possible selectively to enrich for microcapsules 
containing gene products of the desired activity. 

(H) In a second embodiment, the genetic elements are sorted following pooling of the 
10 microcapsules into one or more common compartments. In this embodiment, a gene product 
having the desired activity modifies the genetic element which encoded it (and which resides 
in the same microcapsule) in such a way as to make it selectable in a subsequent step. The 
reactions are stopped and the microcapsules are then broken so that all the contents of the 
individual microcapsules are pooled. Selection for the modified genetic elements enables 

15 enrichment of the genetic elements encoding the gene produces) having the desired activity. 
Accordingly, the invention provides a method according to the first aspect of the invention, 
wherein in step (b) the gene product having the desired activity modifies the genetic 
element encoding it to enable the isolation of the genetic element. It is to be understood, of 
course, that modification may be direct, in that it is caused by the direct action of the gene 

20 product on the genetic element, or indirect, in which a series of reactions, one or more of 
which involve the gene product having the desired activity, leads to modification of the 
genetic element. 

(IH) In a third embodiment, the genetic elements are sorted following pooling of the 
25 microcapsules into one or more common compartments. In this embodiment, a gene with a 
desired activity induces a change in the microcapsule containing the gene product and the 
genetic element encoding it. This change, when detected, triggers the modification of the 
gene within the compartment. The reactions are stopped and the microcapsules are then 
broken so that all the contents of the individual microcapsules are pooled. Selection for the 
30 modified genetic elements enables enrichment of the genetic elements encoding the gene 
produces) having the desired activity. Accordingly the invention provides a method 
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according to the first aspect of the invention, where in step (b) the gene product having the 
desired activity induces a change in the compartment which is detected and triggers the 
modification of the genetic element within the compartment so as to allow its isolation. It is 
to be understood that the detected change in the compartment may be caused by the direct 
5 action of the gene product, or indirect action, in which a series of reactions, one or more of 
which involve the gene product having the desired activity leads to the detected change. 

(TV) In a fourth embodiment, the genetic elements may be sorted by a multi-step 
procedure, which involves at least two steps, for example, in order to allow the exposure of 
10 the genetic elements to conditions which permit at least two separate reactions to occur. As 
will be apparent to a persons skilled in the art, the first microencapsulation step of the 
invention must result in conditions which permit the expression of the genetic elements - be it 
transcription, transcription and/or translation, replication or the like. Under these conditions, 
it may not be possible to select for a particular gene product activity, for example because the 
15 gene product may not be active under these conditions, or because the expression system 
contains an interfering activity. The invention therefore provides a method according to the 
first aspect of the present invention, wherein step (b) comprises expressing the genetic 
elements to produce their respective gene products within the microcapsules, linking the 
gene products to the genetic elements encoding them and isolating the complexes thereby 
20 formed. This allows for the genetic elements and their associated gene products to be 

isolated from the capsules before sorting according to gene product activity takes place. In 

a preferred embodiment, the complexes are subjected to a further compartmentalisation step 

prior to isolating the genetic elements encoding a gene product having the desired activity. 

This further compartmentalisation step, which advantageously takes place in microcapsules, 
25 permits the performance of further reactions, under different conditions, in an environment 

where the genetic elements and their respective gene products are physically linked. 

Eventual sorting of genetic elements may be performed according to embodiment (I), (H) 

or (IE) above. 
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The "secondary encapsulation" may also be performed with genetic elements linked to 
gene products by other means, such as by phage display, polysome display, RNA-peptide 
fusion or lac repressor peptide fusion. 

The selected genetic elements) may also be subjected to subsequent, possibly more stringent 
rounds of sorting in iteratively repeated steps, reapplying the method of the invention either 
in its entirety or in selected steps only. By tailoring the conditions appropriately, genetic 
elements encoding gene products having a better optimised activity may be isolated after each 
round of selection. 

Additionally, the genetic elements isolated after a first round of sorting may be subjected to 
mutagenesis before repeating the sorting by iterative repetition of the steps of the method of 
the invention as set out above. After each round of mutagenesis, some genetic elements will 
have been modified in such a way that the activity of the gene products is enhanced. 

Moreover, the selected genetic elements can be cloned into an expression vector to allow 
further characterisation of the genetic elements and their products. 

In a second aspect, the invention provides a product when selected according to the first 
aspect of the invention. As used in this context, a "product" may refer to a gene product, 
selectable according to the invention, or the genetic element (or genetic information 
comprised therein). 

In a third aspect, the invention provides a method for preparing a gene product, comprising 
the steps of: 

(a) preparing a genetic element encoding the gene product; 

(b) compartmentalising genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

(d) sorting the genetic elements which produce the gene produces) having the 
desired activity; and 
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In accordance with the third aspect, step (a) preferably comprises preparing a repertoire of 
genetic elements, wherein each genetic element encodes a potentially differing gene 
product. Repertoires may be generated by conventional techniques, such as those 
employed for the generation of libraries intended for selection by methods such as phage 
display. Gene products having the desired activity may be selected from the repertoire, 
according to the present invention. 

In a fourth aspect, the invention provides a method for screening a compound or 
compounds capable of modulation the activity of a gene product, comprising the steps of: 

(a) preparing a repertoire of genetic element encoding gene product; 

(b) compartmentalising genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

(d) sorting the genetic elements which produce the gene product(s) having the 
desired activity; and 

(e) contacting a gene product having the desired activity with the compound or 
compounds and monitoring the modulation of an activity of the gene product by the 
compound or compounds. 

Advantageously, the method further comprises the step of: 

(f) identifying the compound or compounds capable of modulating the activity of the 
gene product and synthesising said compound or compounds. 



This selection system can be configured to select for RNA, DNA or protein molecules with 
catalytic, regulatory or binding activity. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 

Gene selection by compartmentalisation. 
5 a Schematic representation of the selection procedure. In Step /, an in vitro 
transcription/translation reaction mixture containing a library of genetic elements linked to a 
substrate for the reaction being selected is dispersed to form a water-in-oil emulsion with 
typically one genetic element per aqueous compartment. The genetic elements are transcribed 
and translated within their compartments (Step 2). Subsequently (Step J), proteins (or RNAs) 

10 with enzymatic activities convert the substrate into a product that remains linked to the 
genetic element. Compartmentalisation prevents the modification of genetic elements in other 
compartments. Next (Step 4), the emulsion is broken, all reactions are stopped and the 
aqueous compartments combined. Genetic elements which are linked to the product are 
selectively enriched, then amplified, and either characterised (Step 5), or linked to the 

15 substrate and compartmentalised for further rounds of selection (Step 6). 

b Selection for target-specific DNA methy lotion by HaelH methylase. The substrate is a 
segment of DNA containing Haelll restriction/modification (R/M) sites. Genetic elements are 
isolated by binding to streptavidin-coated magnetic beads and treated with the cognate 
restriction enzyme Haelll. Only nucleic acids with methylated R/M sites are resistant to 

20 cleavage and subsequently amplified by PCR. 



Figure 2a 

Droplet size distribution and activities ofDHFR and Haelll methylase in emulsions: size 
distribution of the aqueous compartments in an emulsion determined by laser diffraction. In 
25 vitro transcription/translation reaction mixtures containing DNA and sodium deoxycholate are 
emulsified by stirring, or by stirring followed by homogenisation at 8k, 9.5k or 13.5k rpm. 
The size distribution of the aqueous particles is shown by percentage of the total aqueous 
volume. 
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Figure 2b 

772* activity of DHFR formed in situ by transcription and translation of its gene (Fig. lb) in 
aqueous compartments of an emulsion. The concentration of the folA gene used (2.5 nM) 
gives an average of one gene per droplet in the finest emulsions (homogenised at 13.5 k 
5 rpm). The mean diameter calculated from the size distribution data (in Figure 2) is 
presented as a function of the speed of homogenisation (Ok rpm refers to the emulsion 
prepared by stirring with no further homogenisation). Activity is presented as percentage 
of the activity observed in the non-emulsified in vitro reaction mixture under the same 
conditions. 

10 

The activity ofHaelH methylase formed in situ by transcription and translation of its gene 
(Fig. lb) in aqueous compartments of an emulsion. The concentration of the M.Haeffl 
gene used (2.5 nM) gives an average of one gene per droplet in the finest emulsions 
(homogenised at 13.5 k rpm). The mean diameter calculated from the size distribution data 
15 (in Figure 2a) is presented as a function of the speed of homogenisation; (Ok rpm refers to 
the emulsion prepared by stirring with no further homogenisation). Activity is presented as 
percentage of the activity observed in the non-emulsified in vitro reaction mixture under the 
same conditions. 



20 Figure 3 

Selections for HaeUl DNA methylase. 

a Selecting M.Haeffl genes from a 1000-fold excess offolA genes. Reactions were set 
up with 0.2 nM of DIG-/o/A-3s-Biotin DNA (corresponding to an average of one gene per 
compartment), spiked with 0.2 pM of DIG-M.//adII-3s-Biotin. Reaction mixtures were 

25 either emulsified by stirring or left in solution. The DNA from these reactions was captured, 
digested with Haelll (or with Hhal) and amplified by PCR. This DNA was further amplified 
by nested PCR with primers LMB2-Nest and LMB3-Nest and five microlitres of each nested 
PCR was electrophoresed on a 1.5% agarose gel containing ethidium bromide. Markers, 
<f>X174-/fadII digest; minus T7, no T7 RNA polymerase; minus NadCh, no sodium 

30 deoxycholate. 
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b Two-round selections. Reactions containing a 1:10 4 to 1:10 7 molar ratio of DIG- 
M./faeIII-3s-Biotin : DIG-/o/A-3s-Biotin (at 500 pM) are emulsified by stirring. The DNA 
from these reactions is digested with HaeTR and amplified by PCR with primers LMB2- 
Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. NO. 10). The amplified DNA from the 
first round selection of 1:10 4 and 1:10 5 ratios (at 20 pM) and the 1:10 6 and 1:10 7 ratios (at 
500 pM) is put into a second round of selection. This DNA was further amplified by nested 
PCR with primers LMB2-Nest and LMB3-Nest and five microlitres of nested PCR from each 
round of selection are analysed by gel electrophoresis as above (upper panel). The same 
DNA was translated in vitro and the resulting methylase activity was measured. Results are 
presented as the percentage of substrate DNA methylated (lower panel). 



DETAILED DESCRIPTION OF THE INVENTION 



(A) GENERAL DESCRIPTION 

The microcapsules of the present invention require appropriate physical properties to allow 
the working of the invention. 



First, to ensure that the genetic elements and gene products may not diffuse between 
microcapsules, the contents of each microcapsule must be isolated from the contents of the 
surrounding microcapsules, so that there is no or little exchange of the genetic elements and 
gene products between the microcapsules over the timescale of the experiment. 

Second, the method of the present invention requires that there are only a limited number of 
genetic elements per microcapsule. This ensures that the gene product of an individual 
genetic element will be isolated from other genetic elements. Thus, coupling between genetic 
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element and gene product will be highly specific. The enrichment factor is greatest with on 
average one or fewer genetic elements per microcapsule, the linkage between nucleic acid and 
the activity of the encoded gene product being as tight as is possible, since the gene product 
of an individual genetic element will be isolated from the products of all other genetic 
5 elements. However, even if the theoretically optimal situation of, on average, a single 
genetic element or less per microcapsule is not used, a ratio of 5, 10, 50, 100 or 1000 or 
more genetic elements per microcapsule may prove beneficial in sorting a large library. 
Subsequent rounds of sorting, including renewed encapsulation with differing genetic element 
distribution, will peimit more stringent sorting of the genetic elements. Preferably, there is a 
10 single genetic element, or fewer, per microcapsule. 

Third, the formation and the composition of the microcapsules must not abolish the function 
of the machinery the expression of the genetic elements and the activity of the gene products. 

15 Consequently, any microencapsulation system used must fulfil these three requirements. The 
appropriate system(s) may vary depending on the precise nature of the requirements in each 
application of the invention, as will be apparent to the skilled person. 



20 



A wide variety of microencapsulation procedures are available (see Benita, 1996) and may 
be used to create the microcapsules used in accordance with the present invention. Indeed, 
more than 200 microencapsulation methods have been identified in the literature (Finch 
1993). 

These include membrane enveloped aqueous vesicles such as lipid vesicles (liposomes) 
25 (New, 1990) and non-ionic surfactant vesicles (van Hal et al., 1996). These are 
closed-membranous capsules of single or multiple bilayers of non-covalently assembled 
molecules, with each bilayer separated from its neighbour by an aqueous compartment. In 
the case of liposomes the membrane is composed of lipid molecules; these are usually 
phospholipids but sterols such as cholesterol may also be incorporated into the membranes 
(New, 1990). A variety of enzyme-catalysed biochemical reactions, including RNA and 
DNA polymerisation, can be performed within liposomes (Chakrabarti et al., 1994; 
Ooerholzer et al., 1995a; Oberholzer et al., 1995b; Walde et al., 1994- Wick & Luisi' 
1996). 



30 
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With a membrane-enveloped vesicle system much of the aqueous phase is outside the 
vesicles and is therefore non-compartmentalised. This continuous, aqueous phase should be 
removed or the biological systems in it inhibited or destroyed (for example, by digestion of 
nucleic acids with DNase or RNase) in order that the reactions are limited to the 
microcapsules (Luisi etal., 1987). 

Enzyme-catalysed biochemical reactions have also been demonstrated in microcapsules 
generated by a variety of other methods. Many enzymes are active in reverse micellar 
solutions (Bru & Walde, 1991; Bru & Walde, 1993; Creagh et al., 1993; Haber et al., 
1993; Kumar et al., 1989; Luisi & B., 1987; Mao & Walde, 1991; Mao et al., 1992; 
Perez et al., 1992; Walde et al., 1994; Walde et al., 1993; Walde et al, 1988) such as the 
AOT-isooctane-water system (Menger & Yamada, 1979). 

Microcapsules can also be generated by interfacial polymerisation and interfacial 
complexation (Whateley, 1996). Microcapsules of this sort can have rigid, nonpermeable 
membranes, or semipermeable membranes. Semipermeable microcapsules bordered by 
cellulose nitrate membranes, polyamide membranes and lipid-polyamide membranes can all 
support biochemical reactions, including multienzyme systems (Chang, 1987; Chang, 1992; 
Lim, 1984). Alginate/polylysine microcapsules (Lim & Sun, 1980), which can be formed 
under very mild conditions, have also proven to be very biocompatible, providing, for 
example, an effective method of encapsulating living cells and tissues (Chang, 1992; Sun et 
al., 1992). 

Non-membranous microencapsulation systems based on phase partitioning of an aqueous 
environment in a colloidal system, such as an emulsion, may also be used. 

Preferably, the microcapsules of the present invention are formed from emulsions; 
heterogeneous systems of two immiscible liquid phases with one of the phases dispersed in 
the other as droplets of microscopic or colloidal size (Becher, 1957; Sherman, 1968; Lissant, 
1974; Lissant, 1984). 

Emulsions may be produced from any suitable combination of immiscible liquids. Preferably 
the emulsion of the present invention has water (containing the biochemical components) as 
the phase present in the form of finely divided droplets (the disperse, internal or 
discontinuous phase) and a hydrophobic, immiscible liquid (an 'oil') as the matrix in which 
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these droplets are suspended (the nondisperse, continuous or external phase). Such emulsions 
are termed 'water-in-oil' (W/O). This has the advantage that the entire aqueous phase 
containing the biochemical components is compartmentalised in discreet droplets (the internal 
phase). The external phase, being a hydrophobic oil, generally contains none of the 
biochemical components and hence is inert. 

The emulsion may be stabilised by addition of one or more surface-active agents 
(surfactants). These surfactants are termed emulsifying agents and act at the water/oil 
interface to prevent (or at least delay) separation of the phases. Many oils and many 
emulsifiers can be used for the generation of water-in-oil emulsions; a recent compilation 
listed over 16,000 surfactants, many of which are used as emulsifying agents (Ash and Ash, 
1993). Suitable oils include light white mineral oil and non-ionic surfactants (Schick, 1966) 
such as sorbitan monooleate (Span™80; ICI) and polyoxyethylenesorbitan monooleate 
(Tween™ 80; ICI). 

The use of anionic surfactants may also be beneficial. Suitable surfactants include sodium 
cholate and sodium taurocholate. Particularly preferred is sodium deoxycholate, preferably at 
a concentration of 0.5% w/v, or below. Inclusion of such surfactants can in some cases 
increase the expression of the genetic elements and/or the activity of the gene products. 
Addition of some anionic surfactants to a non-emulsified reaction mixture completely 
abolishes translation. During emulsification, however, the surfactant is transferred from the 
aqueous phase into the interface and activity is restored. Addition of an anionic surfactant to 
the mixtures to be emulsified ensures that reactions proceed only after compartmentalisation. 

Creation of an emulsion generally requires the application of mechanical energy to force the 
phases together. There are a variety of ways of doing this which utilise a variety of 
mechanical devices, including stirrers (such as magnetic stir-bars, propeller and turbine 
stirrers, paddle devices and whisks), homogenisers (including rotor-stator homogenisers, 
high-pressure valve homogenisers and jet homogenisers), colloid mills, ultrasound and 
'membrane emulsification' devices (Becher, 1957; Dickinson, 1994). 
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Aqueous microcapsules formed in water-in-oil emulsions are generally stable with little if any 
exchange of genetic elements or gene products between microcapsules. Additionally, we 
have demonstrated that several biochemical reactions proceed in emulsion microcapsules. 
Moreover, complicated biochemical processes, notably gene transcription and translation are 
5 also active in emulsion microcapsules. The technology exists to create emulsions with 
volumes all the way up to industrial scales of thousands of litres (Becher, 1957; Sherman, 
1968; Lissant, 1974; Lissant, 1984). 

The preferred microcapsule size will vary depending upon the precise requirements of any 
10 individual selection process that is to be performed according to the present invention. In all 
cases, there will be an optimal balance between gene library size, the required enrichment 
and the required concentration of components in the individual microcapsules to achieve 
efficient expression and reactivity of the gene products. 



15 The processes of expression must occur within each individual microcapsule provided by the 
present invention. Both in vitro transcription and coupled transcription-translation become 
less efficient at sub-nanomolar DNA concentrations. Because of the requirement for only a 
limited number of DNA molecules to be present in each microcapsule, this therefore sets a 
practical upper limit on the possible microcapsule size. Preferably, the mean volume of the 

20 microcapsules is less that 5.2 x 10~ 16 m 3 , (corresponding to a spherical microcapsule of 
diameter less than lOum, more preferably less than 6.5 x 10" 17 m 3 (5um), more preferably 
about 4.2 x 10* 18 m 3 (2um) and ideally about 9 x 10" ,s m 3 (2.6pm). 

The effective DNA or RNA concentration in the microcapsules may be artificially increased 
25 by various methods that will be well-known to those versed in the art. These include, for 
example, the addition of volume excluding chemicals such as polyethylene glycols (PEG) and 
a variety of gene amplification techniques, including transcription using RNA polymerases 
including those from bacteria such as E. coU (Roberts, 1969; Blattner and Dahlberg, 1972; 
Roberts a al., 1975; Rosenberg et al. , 1975) , eukaryotes e. g. (Weil et al. , 1979; Manley 
30 et al. , 1983) and bacteriophage such as T7, T3 and SP6 (Melton et al. , 1984); the polymerase 
chain reaction (PCR) (Saiki et al., 1988); QP replicase amplification (Miele et al., 1983; 
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Cahill et al. y 1991; Chetverin and Spirin, 1995; Katanaev et al. f 1995); the ligase chain 
reaction (LCR) (Landegren et al. y 1988; Barany, 1991); and self-sustained sequence 
replication system (Fahy et al. y 1991) and strand displacement amplification (Walker et aL, 
1992). Even gene amplification techniques requiring thermal cycling such as PCR and LCR 
5 could be used if the emulsions and the in vitro transcription or coupled 
transcription-translation systems are thermostable (for example, the coupled 
transcription-translation systems could be made from a thermostable organism such as 
Thermus aquaticus). 

10 Increasing the effective local nucleic acid concentration enables larger microcapsules to be 
used effectively. This allows a preferred practical upper limit to the microcapsule volume of 
about 5.2 x 10~ 16 m 3 (corresponding to a sphere of diameter 10nm). 

The microcapsule size must be sufficiently large to accommodate all of the required 
15 components of the biochemical reactions that are needed to occur within the microcapsule. 
For example, in vitro, both transcription reactions and coupled transcription-translation 
reactions require a total nucleoside triphosphate concentration of about 2mM. 

For example, in order to transcribe a gene to a single short RNA molecule of 500 bases in 
20 length, this would require a minimum of 500 molecules of nucleoside triphosphate per 
microcapsule (8.33 x 10" 22 moles). In order to constitute a 2mM solution, this number of 
molecules must be contained within a microcapsule of volume 4.17 x 10" 19 litres (4.17 x-10 22 
m 3 which if spherical would have a diameter of 93nm. 

25 Furthermore, particularly in the case of reactions involving translation, it is to be noted that 
the ribosomes necessary for the translation to occur are themselves approximately 20nm in 
diameter. Hence, the preferred lower limit for microcapsules is a diameter of approximately 
0.1 urn (lOOnm). 

30 Therefore, the microcapsule volume is preferably of the order of between 5.2 x 10 22 m 3 
and 5.2 x 10* 16 m 3 corresponding to a sphere of diameter between O.l^m and 10^m, 
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more preferably of between about 5.2 x 10" m 3 and 6.5 x 10 17 m 3 (lum and 5um). Sphere 
diameters of about 2.6um are most advantageous. 

It is no coincidence that the preferred dimensions of the compartments (droplets of 2.6um 
5 mean diameter) closely resemble those of bacteria, for example, Escherichia are 1.1-1.5 x 
2.0-6.0 urn rods and Azotobacter are 1.5-2.0 um diameter ovoid cells. In its simplest form, 
Darwinian evolution is based on a 'one genotype one phenotype' mechanism. The 
concentration of a single compartmentalised gene, or genome, drops from 0.4 nM in a 
compartment of 2 um diameter, to 25 pM in a compartment of 5 um diameter. The 
10 prokaryotic transcripUon/transIation machinery has evolved to operate in compartments of 
~ 1-2 um diameter, where single genes are at approximately nanomolar concentrations. A 
single gene, in a compartment of 2.6 um diameter is at a concentration of 0.2 nM. This gene 
concentration is high enough for efficient translation. Compartmentalisation in such a volume 
also ensures that even if only a single molecule of the gene product is formed it is present at 
15 about 0.2 nM, which is important if the gene product is to have a modifying activity of the 
genetic element itself. The volume of the microcapsule should thus be selected bearing in 
mind not only the requirements for transcription and translation of the genetic element, but 
also the modifying activity required of the gene product in the method of the invention. 



20 



25 



The size of emulsion microcapsules may be varied simply by tailoring the emulsion 
conditions used to form the emulsion according to requirements of the selection system. The 
larger the microcapsule size, the larger is the volume that will be required to encapsulate a 
given genetic element library, since the ultimately limiting factor will be the size of the 
microcapsule and thus the number of microcapsules possible per unit volume. 



The size of the microcapsules is selected not only having regard to the requirements of the 
transcription/translation system, but also those of the selection system employed for the 
genetic element. Thus, the components of the selection system, such as a chemical 
modification system, may require reaction volumes and/or reagent concentrations which are 
30 not optimal for transcription/translation. As set forth herein, such requirements may be 
accommodated by a secondary re-encapsulation step; moreover, they may be accommodated 
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by selecting the microcapsule size in order to maximise transcription/translation and selection 
as a whole. Empirical determination of optimal microcapsule volume and reagent 
concentration, for example as set forth herein, is preferred. 

A "genetic element" in accordance with the present invention is as described above. 
Preferably, a genetic element is a molecule or construct selected from the group consisting of 
a DNA molecule, an RNA molecule, a partially or wholly artificial nucleic acid molecule 
consisting of exclusively synthetic or a mixture of naturally-occurring and synthetic bases, 
any one of the foregoing linked to a polypeptide, and any one of the foregoing linked to any 
other molecular group or construct. Advantageously, the other molecular group or construct 
may be selected from the group consisting of nucleic acids, polymeric substances, particularly 
beads, for example polystyrene beads, magnetic substances such as magnetic beads, labels, 
such as fluorophores or isotopic labels, chemical reagents, binding agents such as 
macrocycles and the like. 



The nucleic acid portion of the genetic element may comprise suitable regulatory sequences, 
such as those required for efficient expression of the gene product, for example promoters, 
enhancers, translational initiation sequences, polyadenylation sequences, splice sites and the 
like. 



As will be apparent from the following, in many cases the polypeptide or other molecular 
group or construct is a ligand or a substrate which directly or indirectly binds to or reacts 
with the gene product in order to tag the genetic element. This allows the sorting of the 
genetic element on the basis of the activity of the gene product. 

The ligand or substrate can be connected to the nucleic acid by a variety of means that will be 
apparent to those skilled in the art (see, for example, Hermanson, 1996). Any tag will 
suffice that allows for the subsequent selection of the genetic element. Sorting can be by any 
method which allows the preferential separation, amplification or survival of the tagged 
genetic element. Examples include selection by binding (including techniques based on 
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magnetic separation, for example using Dynabeads™), and by resistance to degradation (for 
example by nucleases, including restriction endonucleases). 



One way in which the nucleic acid molecule may be linked to a ligand or substrate is through 
5 biotinylation. This can be done by PCR amplification with a 5'-biotinylation primer such that 
the biotin and nucleic acid are covalently linked. 

The ligand or substrate to be selected can be attached to the modified nucleic acid by a 
variety of means that will be apparent to those of skill in the art. A biotinylated nucleic acid 
10 may be coupled to a polystyrene microbead (0.035 to 0.2nm in diameter) that is coated with 
avidin or streptavidin, that will therefore bind the nucleic acid with very high affinity. This 
bead can be derivatised with substrate or ligand by any suitable method such as by adding 
biotinylated substrate or by covalent coupling. 

1 ^ Alternatively, a biotinylated nucleic acid may be coupled to avidin or streptavidin complexed 
to a large protein molecule such as thyroglobulin (669 Kd) or ferritin (440 Kd). This 
complex can be derivatised with substrate or ligand, for example by covalent coupling to the 
6-amino group of lysines or through a non-covalent interaction such as biotin-avidin. The 
substrate may be present in a form unlinked to the genetic element but containing an inactive 
"tag" that requires a further step to activate it such as photoactivation (e.g. of a "caged" 
biotin analogue, (Sundberg et ai, 1995; Pirrung and Huang, 1996)). The catalyst to be 
selected then converts the substrate to product. The "tag" could then be activated and the 
"tagged" substrate and/or product bound by a tag-binding molecule (e.g. avidin or 
streptavidin) complexed with the nucleic acid. The ratio of substrate to product attached to 
the nucleic acid via the "tag" will therefore reflect the ratio of the substrate and product in 
solution. 



An alternative is to couple the nucleic acid to a product-specific antibody (or other 
product-specific molecule). In this scenario, the substrate (or one of the substrates) is present 
in each microcapsule unlinked to the genetic element, but has a molecular "tag" (for example 
biotin, DIG or DNP). When the catalyst to be selected converts the substrate to product, the 
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product retains the "tag" and is then captured in the microcapsule by the product-specific 
antibody. In this way the genetic element only becomes associated with the "tag" when it 
encodes or produces an enzyme capable of converting substrate to product. 

When all reactions are stopped and the microcapsules are combined, the genetic elements 
encoding active enzymes can be enriched using an antibody or other molecule which binds, 
or reacts specifically with the "tag". Although both substrates and product have the 
molecular tag, only the genetic elements encoding active gene product will co-purify. 

The terms "isolating", "sorting" and "selecting", as well as variations thereof, are used 
herein. Isolation, according to the present invention, refers to the process of separating an 
entity from a heterogeneous population, for example a mixture, such that it is free of at least 
one substance with which it was associated before the isolation process. In a preferred 
embodiment, isolation refers to purification of an entity essentially to homogeneity. Sorting 
of an entity refers to the process of preferentially isolating desired entities over undesired 
entities. In as far as this relates to isolation of the desired entities, the terms "isolating" and 
"sorting" are equivalent. The method of the present invention permits the sorting of desired 
genetic elements from pools (libraries or repertoires) of genetic elements which contain the 
desired genetic element. Selecting is used to refer to the process (including the sorting 
process) of isolating an entity according to a particular property thereof. 

In a highly preferred application, the method of the present invention is useful for sorting 
libraries of genetic elements. The invention accordingly provides a method according to 
preceding aspects of the invention, wherein the genetic elements are isolated from a library 
of genetic elements encoding a repertoire of gene products. Herein, the terms "library", 
"repertoire" and "pool" are used according to their ordinary signification in the art, such 
that a library of genetic elements encodes a repertoire of gene products. In general, 
libraries are constructed from pools of genetic elements and have properties which facilitate 
sorting. 
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Initial selection of a genetic element from a genetic element library using the present 
invention will in most cases require the screening of a large number of variant genetic 
elements. Libraries of genetic elements can be created in a variety of different ways, 
including the following. 



Pools of naturally occurring genetic elements can be cloned from genomic DNA or cDNA 
(Sambrook et aL, 1989) ; for example, phage antibody libraries, made by PCR amplification 
repertoires of antibody genes from immunised or unimmunised donors have proved very 
effective sources of functional antibody fragments (Winter et aL, 1994; Hoogenboom, 1997). 

10 Libraries of genes can also be made by encoding all (see for example Smith, 1985; Parmley 
and Smith, 1988) or part of genes (see for example Lowman et al y 1991) or pools of genes 
(see for example Nissim et al. f 1994) by a randomised or doped synthetic oligonucleotide. 
Libraries can also be made by introducing mutations into a genetic element or pool of genetic 
elements 'randomly* by a variety of techniques in vivo, including; using 'mutator strains', of 

15 bacteria such as E. coli mutDS (Liao et aL, 1986; Yamagishi et aL, 1990; Low et aL. 1996); 
using the antibody hypermutation system of B-lymphocytes (Yelamos et al„ 1995). Random 
mutations can also be introduced both in vivo and in vitro by chemical mutagens, and ionising 
or UV irradiation (see Friedberg et aL, 1995), or incorporation of mutagenic base 
analogues (Freese, 1959; Zaccolo et aL, 1996). 'Random* mutations can also be introduced 

20 into genes in vitro during polymerisation for example by using error-prone polymerases 
(Leung et al. 9 1989). 

Further diversification can be introduced by using homologous recombination either in vivo 
(see Kowalczykowski et aL, 1994 or in vitro (Stemmer, 1994a; Stemmer, 1994b)). 

25 

According to a further aspect of the present invention, therefore, there is provided a method 
of in vitro evolution comprising the steps of: 

(a) selecting one or more genetic elements from a genetic element library according to 
the present invention; 

30 (b) mutating the selected genetic element(s) in order to generate a further library of 

genetic elements encoding a repertoire to gene products; and 
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(c) iteratively repeating steps (a) and (b) in order to obtain a gene product .with 
enhanced activity. 

Mutations may be introduced into the genetic elements(s) as set forth above. 

5 

The genetic elements according to the invention advantageously encode enzymes, preferably 
of pharmacological or industrial interest, activators or inhibitors, especially of biological 
systems, such as cellular signal transduction mechanisms, antibodies and fragments thereof, 
other binding agents suitable for diagnostic and therapeutic applications. In a preferred 
10 aspect, therefore, the invention permits the identification and isolation of clinically or 
industrially useful products. In a further aspect of the invention, there is provided a product 
when isolated by the method of the invention. 

The selection of suitable encapsulation conditions is desirable. Depending on the complexity 
15 and size of the library to be screened, it may be beneficial to set up the encapsulation 
procedure such that 1 or less than 1 genetic element is encapsulated per microcapsule. This 
will provide the greatest power of resolution. Where the library is larger and/or more 
complex, however, this may be impracticable; it may be preferable to encapsulate several 
genetic elements together and rely on repeated application of the method of the invention to 
20 achieve sorting of the desired activity. A combination of encapsulation procedures may be 
used to obtain the desired enrichment. 

Theoretical studies indicate that the larger the number of genetic element variants created the 
more likely it is that a molecule will be created with the properties desired (see Perelson and 
25 Oster, 1979 for a description of how this applies to repertoires of antibodies). Recently it has 
also been confirmed practically that larger phage-antibody repertoires do indeed give rise to 
more antibodies with better binding affinities than smaller repertoires (Griffiths et al., 1994). 
To ensure that rare variants are generated and thus are capable of being selected, a large 
library size is desirable. Thus, the use of optimally small microcapsules is beneficial. 
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The largest repertoire created to date using methods that require an in vivo step 
(phage-display and LacI systems) has been a 1.6 x 10 u clone phage-peptide library which 
required the fermentation of 15 litres of bacteria (Fisch et al., 1996). SELEX experiments 
are often carried out on very large numbers of variants (up to 10 15 ). 

5 

Using the present invention, at a preferred microcapsule diameter of 2.6p.m. a repertoire size 
of at least 10 11 can be selected using 1ml aqueous phase in a 20 ml emulsion. 

In addition to the genetic elements described above, the microcapsules according to the 
10 invention will comprise further components required for the sorting process to take place. 
Other components of the system will for example comprise those necessary for transcription 
and/or translation of the genetic element. These are selected for the requirements of a 
specific system from the following; a suitable buffer, an in vitro transcription/replication 
system and/or an in vitro translation system containing all the necessary ingredients, enzymes 
15 and cofactors, RNA polymerase, nucleotides, nucleic acids (natural or synthetic), transfer 
RNAs, ribosomes and amino acids, and the substrates of the reaction of interest in order to 
allow selection of the modified gene product. 

A suitable buffer will be one in which all of the desired components of the biological system 
20 are active and will therefore depend upon the requirements of each specific reaction system. 
Buffers suitable for biological and/or chemical reactions are known in the art and recipes 
provided in various laboratory texts, such as Sambrook et al., 1989. - 

The in vitro translation system will usually comprise a cell extract, typically from bacteria 
25 (Zubay, 1973; Zubay, 1980; Lesley et al., 1991; Lesley, 1995), rabbit reticulocytes (Pelham 
and Jackson, 1976), or wheat germ (Anderson et al., 1983). Many suitable systems are 
commercially available (for example from Promega) including some which will allow coupled 
transcription/translation (all the bacterial systems and the reticulocyte and wheat germ TNT™ 
extract systems from Promega). The mixture of amino acids used may include synthetic 
30 amino acids if desired, to increase the possible number or variety of proteins produced in the 
library. This can be accomplished by charging tRNAs with artificial amino acids and using 
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these tRNAs for the in vitro translation of the proteins to be selected (Eliman et al., 1991; 
Benner, 1994; Mendel et al., 1995). 

After each round of selection the enrichment of the pool of genetic elements for those 
5 encoding the molecules of interest can be assayed by non-compartmentalised in vitro 
transcription/replication or coupled transcription-translation reactions. The selected pool is 
cloned into a suitable plasmid vector and RNA or recombinant protein is produced from the 
individual clones for further purification and assay. 

0 The invention moreover relates to a method for producing a gene product, once a genetic 
element encoding the gene product has been sorted by the method of the invention. Clearly, 
the genetic element itself may be directly expressed by conventional means to produce the 
gene product. However, alternative techniques may be employed, as will be apparent to 
those skilled in the art. For example, the genetic information incorporated in the gene 

5 product may be incorporated into a suitable expression vector, and expressed therefrom. 

The invention also describes the use of conventional screening techniques to identify 
compounds which are capable of interacting with the gene products identified by the first 
aspect of the invention. In preferred embodiments, gene product encoding nucleic acid is 
incorporated into a vector, and introduced into suitable host cells to produce transformed 
cell lines that express the gene product. The resulting cell lines can then be produced for 
reproducible qualitative and/or quantitative analysis of the effectfs) of potential 4rugs 
affecting gene product function. Thus gene product expressing cells may be employed for 
the identification of compounds, particularly small molecular weight compounds, which 
modulate the function of gene product. Thus host cells expressing gene product are useful 
for drug screening and it is a further object of the present invention to provide a method 
for identifying compounds which modulate the activity of the gene product, said method 
comprising exposing cells containing heterologous DNA encoding gene product, wherein 
said cells produce functional gene product, to at least one compound or mixture of 
compounds or signal whose ability to modulate the activity of said gene product is sought 
to be determined, and thereafter monitoring said cells for changes caused by said 
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modulation. Such an assay enables the identification of modulators, such as agonists, 
antagonists and allosteric modulators, of the gene product. As used herein, a compound or 
signal that modulates the activity of gene product refers to a compound that alters the 
activity of gene product in such a way that the activity of gene product is different in the 
presence of the compound or signal (as compared to the absence of said compound or 
signal). 

Cell-based screening assays can be designed by constructing cell lines in which the 
expression of a reporter protein, i.e. an easily assayable protein, such as b galactosidase, 
chloramphenicol acetyltransferase (CAT) or luciferase, is dependent on gene product. Such 
an assay enables the detection of compounds that directly modulate gene product function, 
such as compounds that antagonise gene product, or compounds that inhibit or potentiate 
other cellular functions required for the activity of gene product. 

15 The present invention also provides a method to exogenously affect gene product dependent 
processes occurring in cells. Recombinant gene product producing host cells, e.g. 
mammalian cells, can be contacted with a test compound, and the modulating effect(s) 
thereof can then be evaluated by comparing the gene product-mediated response in the 
presence and absence of test compound, or relating the gene product-mediated response of 
test cells, or control cells (i.e., cells that do not express gene product), to the presence of 
the compound. 



10 



20 



In a further aspect, the invention relates to a method for optimising a production process 
which involves at least one step which is facilitated by a polypeptide. For example, the 
25 step may be a catalytic step, which is facilitated by an enzyme. Thus, the invention 
provides a method for preparing a compound or compounds comprising the steps of: 

(a) providing a synthesis protocol wherein at least one step is facilitated by a 
polypeptide; 

(b) preparing genetic elements encoding variants of the polypeptide which facilitates 
30 this step; 

(c) compartmentalising the genetic elements into microcapsules; 



WO99/02671 PCT,GB98,0,889 

27 

(d) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

(e) sorting the genetic elements which produce polypeptide gene product(s) having 
the desired activity; and 

(0 preparing the compound or compounds using the polypeptide gene product 
identified in (e) to facilitate the relevant step of the synthesis. 

By means of the invention, enzymes involved in the preparation of a compound may be 
optimised by selection for optimal activity. The procedure involves the preparation of 
variants of the polypeptide to be screened, which equate to a library of polypeptides as 
refereed to herein. The variants may be prepared in the same manner as the libraries 
discussed elsewhere herein. 



(B) SELECTION PROCEDURES 



The system can be configured to select for RNA, DNA or protein gene product molecules 
with catalytic, regulatory or binding activity. 



(i) AFFINITY SELECTION 



In the case of selection for a gene product with affinity for a specific ligand the genetic 
element may be linked to the gene product in the microcapsule via the ligand. Only gene 
products with affinity for the ligand will therefore bind to the genetic element itself and 
therefore only genetic elements that produce active product will be retained in the selection 
step. In this embodiment, the genetic element will thus comprise a nucleic acid encoding the 
gene product linked to a ligand for the gene product. 

In this embodiment, all the gene products to be selected contain a putative binding domain, 
which is to be selected for, and a common feature - a tag. The genetic element in each 
microcapsule is physically linked to the ligand. If the gene product produced from the 



WO 99/02671 T>m,™ n «,^„» n 

PCT/GB98/01889 

28 

genetic element has affinity for the ligand, it will bind to it and become physically linked to 
the same genetic element that encoded it, resulting in the genetic element being 'tagged'. At 
the end of the reaction, all of the microcapsules are combined, and all genetic elements and 
gene products pooled together in one environment. Genetic elements encoding gene products 
exhibiting the desired binding can be selected by affinity purification using a molecule that 
specifically binds to, or reacts specifically with, the "tag". 



In an alternative embodiment, genetic elements may be sorted on the basis that the gene 
product, which binds to the ligand, merely hides the ligand from, for example, further 
10 binding partners. In this eventuality, the genetic element, rather than being retained during 
an affinity purification step, may be selectively eluted whilst other genetic elements are 
bound. 



In an alternative embodiment, the invention provides a method according to the first aspect 
15 of the invention, wherein in step (b) the gene products bind to genetic elements encoding 
them. The gene products together with the attached genetic elements are then sorted as a 
result of binding of a ligand to gene products having the desired activity. For example, all 
gene products can contain an invariant region which binds covalently or non-covalently to the 
genetic element, and a second region which is diversified so as to generate the desired 
20 binding activity. 

Sorting by affinity is dependent on the presence of two members of a binding pair in-such 
conditions that binding may occur. Any binding pair may be used for this purpose. As used 
herein, the term binding pair refers to any pair of molecules capable of binding to one 

25 another. Examples of binding pairs that may be used in the present invention include an 
antigen and an antibody or fragment thereof capable of binding the antigen, the biotin- 
avidin/streptavidin pair (Savage et al, 1994), a calcium-dependent binding polypeptide and 
ligand thereof (e.g. calmodulin and a calmodulin-binding peptide (Stofko et al. 9 1992; 
Montigiani et aL ,1996)), pairs of polypeptides which assemble to form a leucine zipper 

30 (Tripet et al., 1996), histidines (typically hexahistidine peptides) and chelated Cu 2+ , Zn 2+ and 
Ni 2+ , (e.g. Ni-NTA; Hochuli et aL, 1987), RNA-binding and DNA-binding proteins (Klug, 
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1995) including those containing zinc-finger motifs (Klug and Schwabe, 1995) and DNA 
methyltransferases (Anderson, 1993), and their nucleic acid binding sites. 



(ii) CATALYSIS 

When selection is for catalysis, the genetic element in each microcapsule may comprise the 
substrate of the reaction. If the genetic element encodes a gene product capable of acting as a 
catalyst, the gene product will catalyse the conversion of the substrate into the product. 
Therefore, at the end of the reaction the genetic element is physically linked to the product of 
the catalysed reaction. When the microcapsules are combined and the reactants pooled, 
genetic elements encoding catalytic molecules can be enriched by selecting for any property 
specific to the product (Figure 1). 



For example, enrichment can be by affinity purification using a molecule (e.g. an antibody) 
that binds specifically to the product. Equally, the gene product may have the effect of 
modifying a nucleic acid component of the genetic element, for example by methylation (or 
demethylation) or mutation of the nucleic acid, rendering it resistant to or susceptible to 
attack by nucleases, such as restriction endonucleases. 

Alternatively, selection may be performed indirectly by coupling a first reaction to 
subsequent reactions that takes place in the same microcapsule. There are two general ways 
in which this may be performed. First, the product of the first reaction could be reacted 
with, or bound by, a molecule which does not react with the substrate of the first reaction. A 
second, coupled reaction will only proceed in the presence of the product of the first reaction. 
An active genetic element can then be purified by selection for the properties of the product 
of the second reaction. 



Alternatively, the product of the reaction being selected may be the substrate or cofactor 
for a second enzyme-catalysed reaction. The enzyme to catalyse the second reaction can 
either be translated in situ in the microcapsules or incorporated in the reaction mixture prior 
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to microencapsulation. Only when the first reaction proceeds will the coupled enzyme 
generate a selectable product. 



This concept of coupling can be elaborated to incorporate multiple enzymes, each using as a 
5 substrate the product of the previous reaction. This allows for selection of enzymes that will 
not react with an immobilised substrate. It can also be designed to give increased sensitivity 
by signal amplification if a product of one reaction is a catalyst or a cofactor for a second 
reaction or series of reactions leading to a selectable product ( for example, see Johannsson 
and Bates, 1988; Johannsson, 1991). Furthermore an enzyme cascade system can be based 
10 on the production of an activator for an enzyme or the destruction of an enzyme inhibitor (see 
Mize et aL y 1989). Coupling also has the advantage that a common selection system can be 
used for a whole group of enzymes which generate the same product and allows for the 
selection of complicated chemical transformations that cannot be performed in a single step. 

15 Such a method of coupling thus enables the evolution of novel "metabolic pathways" in vitro 
in a stepwise fashion, selecting and improving first one step and then the next. The selection 
strategy is based on the final product of the pathway, so that all earlier steps can be evolved 
independently or sequentially without setting up a new selection system for each step of the 
reaction. 



Expressed in an alternative manner, there is provided a method of isolating one or more 
genetic elements encoding a gene product having a desired catalytic activity, comprising the 
steps of: 

(1) expressing genetic elements to give their respective gene products; 

(2) allowing the gene products to catalyse conversion of a substrate to a product, 
which may or may not be directly selectable, in accordance with the desired activity; 

(3) optionally coupling the first reaction to one or more subsequent reactions, each 
reaction being modulated by the product of the previous reactions, and leading to the creation 
of a final, selectable product; 

(4) linking the selectable product of catalysis to the genetic elements by either: 
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a) coupling a substrate to the genetic elements in such a way that the 
product remains associated with the genetic elements, or 

b) reacting or binding the selectable product to the genetic elements by 
way of a suitable molecular "tag" attached to the substrate which remains on 
the product, or 

c) coupling the selectable product (but not the substrate) to the genetic 
elements by means of a product-specific reaction or interaction with the 
product; and 

(5) selecting the product of catalysis, together with the genetic element to which it is 
bound, either by means of a specific reaction or interaction with the product, or by affinity 
purification using a suitable molecular "tag" attached to the product of catalysis, wherein 
steps (1) to (4) each genetic element and respective gene product is contained within a 
microcapsule. 



15 (Hi) REGULATION 

A similar system can be used to select for regulatory properties of enzymes. 

In the case of selection for a regulator molecule which acts as an activator or inhibitor of a 
20 biochemical process, the components of the biochemical process can either be translated in 
situ in each microcapsule or can be incorporated in the reaction mixture prior to 
microencapsulation. If the genetic element being selected is to encode an activator, selection 
can be performed for the product of the regulated reaction, as described above in connection 
with catalysis. If an inhibitor is desired, selection can be for a chemical property specific to 
25 the substrate of the regulated reaction. 



There is therefore provided a method of sorting one or more genetic elements coding for a 
gene product exhibiting a desired regulatory activity, comprising the steps of: 
(1) expressing genetic elements to give their respective gene products; 
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(2) allowing the gene products to activate or inhibit a biochemical reaction, or 
sequence of coupled reactions, in accordance with the desired activity, in such a way as to 
allow the generation or survival of a selectable molecule; 

(3) linking the selectable molecule to the genetic elements either by 

a) having the selectable molecule, or the substrate from which it derives, attached 
to the genetic elements, or 

b) reacting or binding the selectable product to the genetic elements, by way of a 
suitable molecular "tag" attached to the substrate which remains on the product, or 

c) coupling the product of catalysis (but not the substrate) to the genetic elements, 
by means of a product-specific reaction or interaction with the product; 

(4) selecting the selectable product, together with the genetic element to which it is 
bound, either by means of a specific reaction or interaction with the selectable product, or 
by affinity purification using a suitable molecular "tag" attached to the product of catalysis, 
wherein steps (1) to (4) each genetic element and respective gene product is contained within 
a microcapsule. 

(iv) MICROCAPSULE SORTING 

The invention provides for the sorting of intact microcapsules where this is enabled by the 
sorting techniques being employed. Microcapsules may be sorted as such when the change 
induced by the desired gene product either occurs or manifests itself at the surface of the 
microcapsule or is detectable from outside the microcapsule. The change may be caused by 
the direct action of the gene product, or indirect, in which a series of reactions, one or 
more of which involve the gene product having the desired activity leads to the change. 
For example, the microcapsule may be so configured that the gene product is displayed at its 
surface and thus accessible to reagents. Where the microcapsule is a membranous 
microcapsule, the gene product may be targeted or may cause the targeting of a molecule to 
the membrane of the microcapsule. This can be achieved, for example, by employing a 
membrane localisation sequence, such as those derived from membrane proteins, which will 
favour the incorporation of a fused or linked molecule into the microcapsule membrane. 
Alternatively, where the microcapsule is formed by phase partitioning such as with water-in- 
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oil emulsions, a molecule having parts which are more soluble in the extra-capsular phase 
will arrange themselves such that they are present at the boundary of the microcapsule. 

In a preferred aspect of the invention, however, microcapsule sorting is applied to sorting 
systems which rely on a change in the optical properties of the microcapsule, for example 
absorption or emission characteristics thereof, for example alteration in the optical properties 
of the microcapsule resulting from a reaction leading to changes in absorbance, luminescence, 
phosphorescence or fluorescence associated with the microcapsule. All such properties are 
included in the term "optical". In such a case, microcapsules can be sorted by luminescence, 
fluorescence or phosphorescence activated sorting. In a highly preferred embodiment, 
fluorescence activated sorting is employed to sort microcapsules in which the production of a 
gene product having a desired activity is accompanied by the production of a fluorescent 
molecule in the cell. For example, the gene product itself may be fluorescent, for example a 
fluorescent protein such as GFP. Alternatively, the gene product may induce or modify the 
15 fluorescence of another molecule, such as by binding to it or reacting with it. 

(v) MICROCAPSULE IDENTIFICATION 

Microcapsules may be identified by virtue of a change induced by the desired gene product 
20 which either occurs or manifests itself at the surface of the microcapsule or is detectable 
from the outside as described in section iii (Microcapsule Sorting). This change, when 
identified, is used to trigger the modification of the gene within the compartment: -In a 
preferred aspect of the invention, microcapsule identification relies on a change in the 
optical properties of the microcapsule resulting from a reaction leading to luminescence, 
25 phosphorescence or fluorescence within the microcapsule. Modification of the gene within 
the microcapsules would be triggered by identification of luminescence, phosphorescence 
or fluorescence. For example, identification of luminescence, phosphorescence or 
fluorescence can trigger bombardment of the compartment with photons (or other particles 
or waves) which leads to modification of the genetic element. A similar procedure has 
30 been described previously for the rapid sorting of cells (Keij et al, 1994). Modification of 
the genetic element may result, for example, from coupling a molecular "tag", caged by a 
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photolabile protecting group to the genetic elements: bombardment with photons of an 
appropriate wavelength leads to the removal of the cage. Afterwards, all microcapsules are 
combined and the genetic elements pooled together in one environment. Genetic elements 
encoding gene products exhibiting the desired activity can be selected by affinity 
purification using a molecule that specifically binds to, or reacts specifically with, the 
"tag". 



(vi) MULTI-STEP PROCEDURE 

10 It will be also be appreciated that according to the present invention, it is not necessary for all 
the processes of transcription/replication and/or translation, and selection to proceed in one 
single step, with all reactions taking place in one microcapsule. The selection procedure may 
comprise two or more steps. First, transcription/replication and/or translation of each genetic 
element of a genetic element library may take place in a first microcapsule. Each gene 

15 product is then linked to the genetic element which encoded it (which resides in the same 
microcapsule). The microcapsules are then broken, and the genetic elements attached to their 
respective gene products optionally purified. Alternatively, genetic elements can be attached 
to their respective gene products using methods which do not rely on encapsulation. For 
example phage display (Smith, G.P.,1985), polysome display (Mattheakkis et al. 9 1994), 
20 RNA-peptide fusion (Roberts and Szostak, 1997) or lac repressor peptide fusion (Cull, et 
al. 9 1992). 

In the second step of the procedure, each purified genetic element attached to its gene product 
is put into a second microcapsule containing components of the reaction to be selected. This 

25 reaction is then initiated. After completion of the reactions, the microcapsules are again 
broken and the modified genetic elements are selected. In the case of complicated multistep 
reactions in which many individual components and reaction steps are involved, one or more 
intervening steps may be performed between the initial step of creation and linking of gene 
product to genetic element, and the final step of generating the selectable change in the 

30 genetic element. 
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(vii) SELECTION BY ACTIVATION OF REPORTER GENE EXPRESSION IN SITU 

The system can be configured such that the desired binding, catalytic or regulatory activity 
encoded by a genetic element leads, directly or indirectly to the activation of expression of 
5 a "reporter gene" that is present in all microcapsules. Only gene products with the desired 
activity activate expression of the reporter gene. The activity resulting from reporter gene 
expression allows the selection of the genetic element (or of the compartment containing it) 
by any of the methods described herein. 

10 For example, activation of the reporter gene may be the result of a binding activity of the 
gene product in a manner analogous to the "two hybrid system" (Fields and Song, 1989). 
Activation might also result from the product of a reaction catalysed by a desirable gene 
product. For example, the reaction product could be a transcriptional inducer of the 
reporter gene. For example arabinose could be used to induce transcription from the 

15 araBAD promoter. The activity of the desirable gene product could also result in the 
modification of a transcription factor, resulting in expression of the reporter gene. For 
example, if the desired gene product is a kinase or phosphatase the phosphorylation or 
dephosphorylation of a transcription factor may lead to activation of reporter gene 
expression. 



20 



(viii) AMPLIFICATION 



According to a further aspect of the present invention the method comprises the further step 
of amplifying the genetic elements. Selective amplification may be used as a means to enrich 
25 for genetic elements encoding the desired gene product. 



In all the above configurations, genetic material comprised in the genetic elements may be 
amplified and the process repeated in iterative steps. Amplification may be by the 
polymerase chain reaction (Saiki et al., 1988) or by using one of a variety of other gene 
amplification techniques including; Qp replicase amplification (Cahill, Foster and Mahan, 
1991; Chetverin and Spirin, 1995; Katanaev, Kurnasov and Spirin, 1995); the ligase chain 
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reaction (LCR) (Landegren et aL, 1988; Barany, 1991); the self-sustained sequence 
replication system (Fahy, Kwoh and Gingeras, 1991) and strand displacement amplification 
(Walker etal., 1992). 



5 (ix) COMPARTMENT ALISATION 

According to a further aspect of the present invention, there is provided a method for 
compartmentalising a genetic element and expressing the genetic element to form its gene 
product within the compartment, comprising the steps of: 

(a) forming an aqueous solution comprising the genetic element and the components 
necessary to express it to form its gene product; 

(b) microencapsulating the solution so as to form a discrete microcapsule 
comprising the genetic element; and 

(c) exposing the microcapsule to conditions suitable for the expression of the genetic 
15 element to form its gene product to proceed. 

Suitable microencapsulation techniques are described in detail in the foregoing general 
description. 



10 



Preferably, a library of genetic elements encoding a repertoire of gene products is 
encapsulated by the method set forth above, and the genetic elements expressed to produce 
their respective gene products, in accordance with the invention. In a highly preferred 
embodiment, microencapsulation is achieved by forming a water-in-oil emulsion of the 
aqueous solution comprising the genetic elements. 

The invention, accordingly, also provides a microcapsule obtainable by the method set forth 
above. 
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Various aspects and embodiments of the present invention are illustrated in the following 
examples. It will be appreciated that modification of detail may be made without departing 
from the scope of the invention. 



5 All documents mentioned in the text are incorporated by reference. 



EXAMPLES 
10 Example 1. 

The production of approx. 2um aqueous microcapsules in a water-in-oil emulsion 
system. 

Microcapsules within the preferred size range of the present invention can be generated using 
15 a water-in-oil emulsion system. 

Light white mineral oil (Sigma; M-3516) is used herein as the continuous phase and the 
emulsion is stabilised by emulsifiers sorbitan monooleate (Span 80, Fluka; 85548) and 
polyoxyethylenesorbitan monooleate (Tween 80, Sigma Ultra; P-8074) and in some cases 
20 also with 0.5% w/v sodium deoxycholate (Fluka; 30970). 

The oil phase is freshly prepared by dissolving 4.5% (v/v) Span 80 (Fluka) in mineral oil 
(Sigma, #M-5904) followed by 0.5% (v/v) Tween 80 (SigmaUltra; #P-8074). Ice-cooled in 
vitro reaction mixtures (50 ul) are added gradually (in 5 aliquots of 10 ul over -2 minutes) 

25 to 0.95 ml of ice-cooled oil-phase in a 5 ml Costar Biofreeze Vial (#2051) whilst stirring with 
a magnetic bar (8x3 mm with a pivot ring; Scientific Industries International, Loughborough, 
UK). Stirring (at 1150 rpm) is continued for an additional 1 minute on ice. In some 
emulsions the aqueous phase is supplemented with an anionic surfactant - e.g., sodium 
deoxycholate, sodium cholate, sodium glycocholate, and sodium taurocholate, typically to 

30 0.5% (w/v). 



10 



20 



25 



WO 99/02671 _ 

PCT/GB98/01889 

38 

When indicated, the emulsion is farther homogenised using an Ultra-Turrax T25 disperser 
(IKA) equipped with an 8 mm diameter dispersing tool at 8k, 9k or 13.5k rpm for 1 minute, 
or at 20k rpm for 1 or 5 minutes, on ice. This reduces the microcapsule size. 

The reactions may be quenched and the emulsion broken as indicated in individual examples, 
by spinning at 3,000 g for 5 minutes and removing the oil phase, leaving the concentrated 
emulsion at the bottom of the vial. Quenching buffer (typically, 0.2 ml of 25 ug/ml yeast 
RNA in W+B buffer: 1 M NaCl, 10 mM Tris-HCl, 1 mM EDTA pH 7.4) and 2 ml of 
water-saturated diethyl ether is added and the mixture vortexed, centrifuged briefly, and the 
ether phase removed. The aqueous phase is washed with ether and dried (5 minutes in a 
Speedvac at ambient temperature). 



The size distribution of the aqueous droplets in the emulsions was determined by laser 
diffraction using a Coulter LS230 Particle Size Analyser. An aliquot of emulsion, freshly 
15 diluted (1:10) in mineral oil is added to the micro-volume chamber containing stirred mineral 
oil. Results are analysed with the instrument's built-in Mie optical model using refractive 
indices of 1.468 for mineral oil and 1.350 for the aqueous phase. The size distribution of 
the aqueous droplets in the emulsion is shown in Figure 2. Addition of sodium 
deoxycholate does not significanUy alter the size distribution. 



Example 2. 

Efficient in vitro transcription reactions performed in the aqueous microcapsules of a 
water-in-oO emulsion. 

In order to produce RNA from DNA within each microcapsule, the single molecule of DNA 
present within each aqueous microcapsule of the system must be transcribed efficiently. 
Herein, in vitro transcription is demonstrated within microcapsules. 



30 



The catalytic core of the Tetrahymena self-splicing intron is a much-studied ribozyme which 
can catalyse a variety of phosphoester transfer reactions (Sag a al., 1986; Sag and Czech, 
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1986; Sag and Czech, 1986). For example, a modified Tetrahymena intron missing the PI 
stem-loop from the 5'-end, and missing the 3' stem-loops P9.1 and P9.2 can function as an 
RNA ligase, efficiently splicing together two or more oligonucleotides aligned on a template 
strand (Green and Szostak, 1992). 



DNA encoding the above-described Tetrahymena ribozyme is PCR-amplified using primers 
P2T7Ba (which anneals to the P2 loop region and appends a 17 RNA polymerase promoter) 
and P9Fo (which anneals to the P9 loop region). This creates a 331 base pair DNA fragment 
carrying the 17 RNA polymerase promoter. This fragment is purified directly using Wizard 
PCR Preps (Promega) and used as the template for an in vitro transcription reaction using T7 
RNA polymerase. 



In vitro transcription is assayed over an initial 10 minute period during which the reaction 
rate is essentially linear (Chamberlin and Ring, 1973). Reaction conditions for transcription 
are as described by Wyatt et al. , 1991 . 

Incorporation of [y'^P] UTP is used to assay the progression of the reaction. 

A transcription reaction is set up in a volume of 200ul and divided into 2 aliquots, each 
containing 3 x 10" molecules of DNA (5nM). One lOOul aliquot is added to 2ml Sigma light 
mineral oil containing 4.5% Span 80 and 0.5% Tween 80 and homogenised for 5 minutes 
with an Ultra-Turrax T25 disperser at 20,000 rpm as in Example 1. Based on the mean 
microcapsule volume in these emulsions (2.8 X 10" 19 m 3 for a 0.81um diameter microcapsule) 
the lOOul reaction would be divided into 3.6 x 10" microcapsules. Hence, there should be 1 
molecule of DNA per microcapsule on average. 

Both aliquots are incubated in a 37°C water bath. 0.5 ml samples of the emulsion are 
removed both before the start of the incubation and after 10 minutes and placed on ice. 
Similar 25ul samples are removed from the non-emulsified control reactions at the same time. 
Emulsions are broken and reactions stopped with 0.5 ml EDTA (50 mM) and 2 ml water- 
saturated diethyl ether as described in Example 1. IOOuJ salmon sperm DNA (500ng/ml) in 
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20 mM EDTA is then added. Three IOOjiI aliquots are then removed from both emulsions 
and controls and labelled RNA is assayed by TCA precipitation and scintillation counting. 

The rate of transcription is taken as the increase in acid perceptible cpm over the 10 minute 
5 incubation at 37°C. In the non emulsified control reaction there are 442,000 cpm acid 
perceptible material compared to 147,000 cpm in the emulsion. Hence the rate of 
transcription in the emulsion is 33% of that found in the non-emulsified control reaction. 

This procedure therefore shows that RNA can be efficiently synthesised by T7 RNA 
10 polymerase in the aqueous microcapsules of a water-in-oil emulsion. 

Example 3. 

Efficient coupled in vitro transcription/translation reactions performed in the aqueous 
microcapsules of a water-in-oil emulsion. 

15 

In order to synthesise proteins using the procedure of the present invention, translation must 
be active in the aqueous microcapsules of the water-in-oil emulsion described herein. 

Here it is shown how a protein (E. coli dihydrofolate reductase) can be efficiently produced 
20 from DNA in the aqueous microcapsules of a water-in-oil emulsion system using a coupled 
transcription/translation system. 

The E. coli folk gene encoding dihydrofolate reductase (DHFR) is PCR-amplified using 
oligonucleotides EDHFRFo and EDHFRBa. This DNA is then cloned into the pGEM-4Z 
25 vector (Promega) digested with HindHl and Kpnl downstream of the both the lac promoter 
and the T7 RNA polymerase promoter. The oligonucleotide EDHFRBa appends the efficient 
phage T7 gene 10 translational start site upstream of the DHFR start codon. 

DNA sequencing identifies a clone which has the correct nucleotide sequence. Bacteria 
30 transformed with this clone (pGEM-/tf/A) are found to over express active DHFR (driven 
from the lac promoter) when induced with IPTG. 



WO 99/02671 



41 



PCT/GB98/01889 



The pGEM-/o/A plasmid is then PCR-amplified using primers LMB2 and LMB3 under the 
conditions described above to create a 649bp DNA fragment carrying the T7 RNA 
polymerase promoter, the phage T7 gene 10 translation^ start site and the/o/A gene. This 
PCR fragment is purified directly using Wizard PCR Preps (Promega) and used to program a 
prokaryotic in vitro coupled transcription/translation system designed for linear templates 
(Lesley, Brow and Burgess, 1991). 

A commercial preparation of this system is used (E. coU S30 Extract System for Linear 
Templates; Promega) supplemented with T7 RNA polymerase. 

A 300ul translation reaction is set up on ice containing 3 x 10 12 molecules of DNA. T7 RNA 
polymerase (10 4 units) is added to drive transcription and the translated protein is labelled by 
the addition of [ 3S S] methionine. A 150ul aliquot of this reaction is added to 2.85 ml Sigma 
light mineral oil containing 4.5% Span 80 and 0.5% Tween 80 and homogenised for 1 
minute with an Ultra-Turrax T25 disperser at 20,000 rpm, as in Example 1. The other 
aliquot is not emulsified. 

Based on the mean microcapsule volume in the emulsions (1.1 x 10 l8 m 3 for a 1.29um 
diameter microcapsule) the 150uJ reaction would be divided into 1.3 x 10", microcapsules). 
Hence, there should be roughly 11 molecules of DNA per microcapsule. 

Four 0.5ml aliquots are removed from the emulsion reaction mix. One aliquot is immediately 
put on ice and the other three are incubated in a 25°C water bath for 2 hours before being 
placed on ice. Four 25jil samples are also removed from the non-emulsified reaction mix; 
one is put immediately on ice and the other three are incubated in a 25°C water bath for 2 
hours and then placed on ice. 

The emulsions are spun in a microfuge at 13,000 r.p.m. for 5 minutes at 4°C and the mineral 
oil removed leaving the concentrated (but still intact) emulsion at the bottom of the tube. 
After briefly re-spinning and removing any further mineral oil, the emulsion is broken and 
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any further translation stopped by adding lOOjil water containing 125ug/ml puromycin, and 1 
ml water saturated diethyl ether. This mixture is vortexed and respun in a microfuge at 
13,000 r.p.m. for 1 minute at 4°C. The ether and dissolved mineral oil is then removed by 
aspiration and the extraction repeated with a further 1 ml of ether. Any remaining ether is 
5 driven off by spinning for 5 minutes in a Speedvac at room temperature. 

lOOul water containing 125ug/ml puromycin is also added to the 25ul non-emulsified 
control reactions. 25ul of each of the samples is then precipitated with acetone and run on a 
20% SDS-PAGE gel according to the instructions given by the manufacturers of the in vitro 
10 transcription/translation system (Promega). The gel is dried and scanned using a 
Phosphorlmager (Molecular Dynamics). A single strong band is seen with the expected 
molecular weight of DHFR (18 kd) in both the reactions performed in emulsions and in the 
controls. This band is accurately quantified. 

15 In the emulsified reactions the mean area under the 18kd peak is 15,073 units whereas the 
mean area under the same peak in the non-emulsified control reactions is 18,990 units. 
Hence, in the emulsified reactions the amount of DHFR protein is calculated to be 79% that 
found in the nonemulsified control reactions. This therefore indicates that the 
transcription/translation system is functional in the water-in-oil emulsion system of the 

20 present invention. 



Example 4. 

Dihydrofolate reductase produced using the coupled in vitro transcription/translation 
25 reactions is active. 

Here it is shown that protein (E. coli dihydrofolate reductase) can be produced efficiently in a 
catalytically active form by coupled transcription/translation of the/o/A gene in the aqueous 
microcapsules of a water-in-oil emulsion system. In this assay, an emulsion comprising 
30 microcapsules below optimal size is used; DHFR activity is shown to be higher in the larger 
microcapsule sizes. 
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175,11 translation reactions (unlabelled) are set up on ice containing either 2 x 10 n , 6 x 10 12 
or 1.8 x 10 12 molecules of the/o/A template DNA used in Example 3, or no DNA. T7 RNA 
polymerase (6 x 10 3 units) are added to each reaction to drive transcription. 

5 

A 100m aliquot of each reaction is added to 1.9ml Sigma light mineral oil containing 4.5% 
Span 80 and 0.5% Tween 80 and homogenised for 1 minute or 5 minutes with an Ultra- 
Turrax T25 Homogeniser equipped with an 8mm diameter dispersing tool, at 20,000 rpm as 
in Example 1. After homogenisation for 1 minute the mean diameter of particles (by volume) 
10 is 1.30pm (median 1.28pm).. 98% by volume of the internal (aqueous) phase is present in 
particles varying from 0.63pm to 2.12pm. After homogenisation for 5 minutes the mean 
diameter of microcapsules (by volume) is 0.81pm (median 0.79pm) and 98% by volume of 
the internal (aqueous) phase is present in particles varying from 0.41pm to 1.38pM. 



15 



20 



25 



Based on the mean microcapsule volume in the 1 minute emulsions (1.1 x 10" 18 m 3 for a 
1.299 pm diameter microcapsule) the lOOpl reaction would be divided into 8.7 x 10 10 
microcapsules). Hence, there should be roughly 1.3, 3.9 or 11.8 molecules of DNA per 
microcapsule. 

Based on the mean microcapsule volume in the 5 min emulsions (2.8 x 10- 19 M 3 for a 
0.81pm diameter microcapsule) the lOOpl reaction would be divided into 3.6 x-10 11 
microcapsules). Hence, there should be roughly 0.3, 1.0 or 2.9 molecules of DNA per 
microcapsule. 

The emulsions, and the non-emulsified reaction mix are incubated in a 25°C water bath. 0.5 
ml samples of the emulsion are removed immediately before the start of the incubation and 
after 2 hours and placed on ice. 25pl samples are removed from»the non-emulsified control 
reactions at the same times. 
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The emulsions are spun in a microfuge at 13,000 r.p.m. for 5 min. at 4°C and the mineral oil 
removed by aspiration, leaving the concentrated (but still intact) emulsion at the bottom of the 
tube. After briefly re-spinning and removing any further mineral oil the emulsion is broken 
and any further translation stopped by adding lOOfil Buffer A (100 mM Imidazole pH 7.0, 10 
5 mM p-mercaptoethanol), containing 125jig/ml puromycin and 1ml water saturated diethyl 
ether. The mixture is vortexed and spun in a microfuge at 13,000 r.p.m. for 1 min. at 4°C. 
The ether and dissolved mineral oil is removed by aspiration and the extraction repeated with 
a further 1ml of ether. Any remaining ether is driven off by spinning for 5 minutes in a 
Speedvac at room temperature. lOOjxl Buffer A containing (125ng/ml) puromycin is also 
10 added to the 25jxl non-emulsified control reactions. 

Dihydrofolate reductase activity is assayed as by spectrophotometrically monitoring the 
oxidation of NADPH to NADP at 340nm over a 10 minute time course as described by 
Williams et aL y 1979; Ma et aL, 1993. lOfil of each quenched in vitro translation reaction is 
15 added to 150^1 Buffer A (100 mM Imidazole, pH 7.0, 10 mM P-mercaptoethanol) and 20^1 
ImM NADPH. 20pl Dihydrofolate (lmM)(H 2 F) is added after 1 minute and the reaction 
monitored at 340nm using a ThermoMax microplate reader (Molecular Devices). Activity is 
calculated by initial velocities under So> >Km conditions (u^). The background activity 
in the S30 extract is subtracted from all samples. 

20 

DHFR activity generated in the emulsions is taken from the difference in activity measured at 
0 hours and 2 hours incubation. No increase in NADPH oxidation occurred between the 0 
hour and 2 hour samples when 0.1|iM methotrexate (a specific inhibitor of DHFR) is added 
25 showing that all the increase in NADPH oxidation observed is due to DHFR produced in the 
in vitro translation reactions. 



Using 1 minute homogenisation at 20,000 rpm, the DHFR activity generated in the emulsions 
is 31% that found in the non-emulsified control reactions with 1.3 molecules of DNA per 
30 microcapsule; 45% with 3.9 molecules of DNA per microcapsule; and 84% with 11.8 
molecules of DNA per microcapsule. 
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Using 5 minute homogenisation at 20,000 rpm, the DHFR activity generated in the emulsions 
is 7% that found in the non-emulsified control reactions with 0.3 molecules of DNA per 
microcapsule; 15% with 1 molecule of DNA per microcapsule; and 35% with 2.9 molecules 
5 of DNA per microcapsule, on average. 

Assuming the turnover number of DHFR is as described by Posner et al., 1996, this- 
corresponds to a yield at the highest DNA concentration of 6.3 ng (340pmole) DHFR per 
lOOpil reaction (non-emulsified control), 1.9fyg (104pmole) DHFR per lOOjxl reaction 
10 (emulsified for 1 min), or 0.46ng (24.8pmole) per lOOfil reaction (emulsified for 5 minutes). 
This equates to 74 molecules DHFR per microcapsule in the 1 minute emulsions and 44 
molecules per microcapsule in the 5 minute emulsions (assuming that all microcapsules are of 
mean size). 

15 The DHFR activity resulting from coupled transcription/translation of folA genes is also 
measured in the larger microcapsules produced by stirring alone, or by stirring followed by 
farther homogenisation with an Ultra-Turrax T25 disperser at 8,000 rpm, 9,000 rpm, or 
13,500 rpm for 1 minute as described in Example 1. The results are presented in Figure 
2b. The concentration of/o/A genes used (2.5 nM) gives an average of 1, 1.5 and 4.8 

20 genetic elements per droplet in the emulsions homogenised at 13,500 rpm, 9,500 rpm and 
8,000 rpm, respectively, and an average of 14 genetic element per droplet in the emulsion 
prepared by stirring only. Addition of sodium deoxycholate (0.5%) to the in vitro 
translation reaction mixture does not significantly affect the DHFR activity observed in the 
broken emulsions. 

25 

Example 5. 

Linkage of an immobilised substrate into a genetic element via a high molecular weight 
protein. 

30 In order to link multiple immobilised substrate molecules to a DNA fragment comprising the 
folk gene, the DNA fragment is first biotinylated and then coupled to a complex of avidin 
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with apoferritin. Horse spleen apoferritin is a large, near spherical protein molecule of 
12.5nm diameter which therefore provides multiple sites which can be derivatised with 
substrate (e.g. the e-amino group of surface lysines). The pGEM-/o/A plasmid encoding E. 

coli DHFR is PCR amplified using the primers LMB3 and 5'-biotinylated LMB2 (LMB2- 
5 Biotin) to create a biotinylated 649bp DNA fragment carrying the T7 RNA polymerase 

promoter, the phage 17 gene 10 translation^ start site and the folk gene (see Example 3). 

The DNA is radiolabeled by supplementing the 500p.l PCR reaction mix with lOOuCi 

[a- 32 P]dCTP (Amersham; 3000 Ci/mmol). The biotinylated PCR fragment is purified 

directly using Wizard PCR Preps (Promega) and the concentration determined 
10 spectrophotometrically. The percentage of DNA biotinylated is assayed by binding to 

Streptavidin M-280 Dynabeads (Dynal) and scintillation counting. 83% of the DNA is 

determined to be biotinylated using this technique. 

The sequestered iron is removed from a commercial conjugate of avidin and ferritin 
15 (Avidin-Ferritin; approx. 1 . 1 mole ferritin per mole avidin; Sigma) by the overnight dialysis 
(4°C) of a solution of avidin-ferritin in PBS (1 mg/ml) against 0.12M thioglycollic acid, pH 
4.25, followed by 24 hours dialysis against PBS (4°C) as described by Kadir and Moore, 
1990. Removal of iron is checked by analysis of the absorbance spectra (sequestered Fe(IH) 
absorbs strongly at 310-360nm). 

20 

0.3 pmole radiolabeled, biotinylated DNA is incubated with varying molar ratios of 
avidin-apoferritin in PBS (total volume 9ul) for 30 minutes at room temperature. A4.5ul 
aliquot is removed and the percentage of DNA complexed with avidin-apoferritin assayed 
using band-shifting assay on a 1.5% agarose gel as described by Berman et al., 1987. The 

25 gel is then dried and scanned using a Phosphorlmager (Molecular Dynamics). The 
percentage of DNA remaining unshifted (i.e. not complexed with avidin-apoferritin) is 17% 
(1: 1 molar ratio avidin-apoferritin: DNA) , 15% (5: 1 molar ratio avidin-apoferritin:DNA) 
or 14% (25:1 molar ratio avidin-apoferritin: DNA). This means that even at a 1:1 ratio of 
avidin-apoferritin: DNA basically all the biotinylated DNA is bound. No band-shifting is 

30 observed when biotinylated DNA is mixed with apoferritin or when non-biotinylated DNA is 
mixed with avidin-apoferritin. 
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The remaining 4.5^1 of DNA complexed with avidin-apoferritin is used as the template for a 
25jil in vitro transcription/translation reaction (E. coli S30 Extract System for Linear 
Templates; Promega). After 2 hours at 25°C, the reaction is stopped by adding 100^1 Buffer 
5 A containing puromycin (125^ig/ml). Dihydrofolate reductase activity is assayed as above by 
spectrophotometrically monitoring the oxidation of NADPH to NADP at 340nm over a 10 
minute time course. 

lOjxl of each in vitro translation reaction is added to 150fil Buffer A and 20^1 NADPH 
10 (ImM). 20jil Dihydrofolate (ImM) (Emulsions were broken and reactions were stopped 
with 0.5 ml EDTA (50 mM) and 2 ml water-saturated diethyl ether as described in 
Example 1) is added after 1 minute and the reaction monitored at 340nm using a ThermoMax 
microplate reader (Molecular Devices). No difference in DHFR activity is found at even the 
highest ratio avidin-apoferritin:DNA compared to a control with no avidin-apoferritin added. 
15 This indicates that the vast majority of DNA can be complexed without compromising the 
efficiency of in vitro translation. 

Example 6. 

Both in vitro transcription-translation and DHFR activity are compatible in the same 
20 system* 

In order to select for the activity of DHFR produced in situ by coupled 
transcription-translation both the transcription-translation reaction and DHFR must be active 
in the same buffer system. 

25 

A direct assay for DHFR activity in a complete E. coli in vitro translation system based on 
the spectrophotometrically monitoring of the oxidation of NADPH to NADP at 340nm is not 
practical due to the turbidity of the S30 extracts. 



30 



However, it is possible to ascertain that DHFR is active in the same buffer system as in vitro 
translation. E. coli DHFR is obtained by DPTG-induction of bacteria containing the plasmid 
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PGEM-/0/A and affinity-purified on a methotrexate-Sepharose column (Baccanari et at., 
1977). 



DHFR activity is compared in Buffer A as above or in an in vitro translation mixture 
5 complete except for the substitution of S30 dialysis buffer (Lesley 1995) (lOmM Tris-acetate 
pH8.0, 14mM magnesium acetate, 60mM potassium acetate, ImM DTT) for the S30 fraction. 
In each case the total reaction volume is 200ul and the concentration of NADPH and 
Emulsions were broken and reactions were stopped with 0.5 ml EDTA (50 mM) and 2 ml 
water-saturated diethyl ether as described in Example 1 each O.lmM. Reactions are 
10 monitored spectrophotometrically at 340nm. Addition of 1.75pmole (1.3mUnits) E. coli 
DHFR gives initial rates of -25.77 mOD/min (in Buffer A) and -11.24 mOD/min (in 
translation buffer), hence the reaction is 44% as efficient in the translation buffer as in an 
optimised buffer (buffer A). 

15 Furthermore, the presence of the substrates of DHFR (NADPH and H 2 F) at O.lmM 
concentration (either alone or in combination) does not cause any inhibition of the production 
of active DHFR from a 2 hour coupled transcription-translation reaction. 

Example 7. 

20 The activity of DHFR on a genetic element containing an immobilised dihydrofolate 
substrate leads to the formation of a tetrahydrofolate product linked to nucleic add 
encoding DHFR. 

A peptide is synthesised comprising three glutamic acids linked via their y-caboxylates 
25 (using N-fluorenylmethoxycarbonyl-glutamic acid a-benzyl ester as a starting material) 
with a lysine at the carboxy-terminus and biotin linked to its e-amino group by modifying 
published procedures (Krumdiek et al., 1980). Folic acid is linked at the amino-terminus 
and the benzyl and trifluoroacetamide protective groups removed by alkaline hydrolysis as 
previously described. The peptide is purified by reverse phase HPLC and characterised by 
30 mass and UV spectroscopy. This folic acid peptide is chemically reduced to the 
corresponding dihydrofolic acid peptide (using dithionate and ascorbic acid) and then to the 
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corresponding tetrahydrofolic acid peptide (using sodium borohydride) by applying 
published procedures (Zakrzewski et al. , 1980). These transformations are characterised by 
UV spectroscopy. 

5 A genetic element is constructed by linking, on average, two to three molecules of the folic 
acid peptide to avidin (or streptavidin) together with one molecule of the DHFR encoding, 
PCR-amplified DNA from the plasmid pGEM-/o/A using primers LMB2-Biotin (SEQ. ID. 
No. 9) and LMB3 (see Example 3). The immobilised folic acid is chemically reduced to 
dihydrofolate using dithionate and ascorbic acid and purified by dialysis against buffer A. 
10 E. coli DHFR is obtained by IPTG induction of bacteria containing the plasmid pGEM- 
folA and affinity purified on a methotrexate-Sepharose column. E. coli DHFR is shown to 
react with the dihydrofolic acid immobilised to this genetic element by monitoring the 
oxidation of NADPH to NADP spectrophotometrically using 0-10 ftM of the avidin-linked 
dihydrofolic acid peptide and 0-50 M M NADPH. Hence, at the end of this reaction, the 
15 product tetrahydrofolate is linked to the/o/A gene which encodes for the enzyme (i.e., 
DHFR) that catalyses its formation. 

i 

To isolate those genes attached to the tetrahydrofolate product there are two approaches. 
The first involves the generation of phage-display antibodies specific for tetrahydrofolate 

20 (Hoogenboom, 1997). The second approach is based on the use of a tagged reagent which 
reacts specifically with the immobilised product, but not with the substrate. We have 
synthesised a molecule consisting of a dinitrophenyl (DNP) tag linked to benzaldehyde via 
a 14 atom spacer. The aldehyde group reacts specifically with tetrahydrofolate to form a 
covalent adduct (Kallen and Jencks, 1966) and affinity purification can be performed using 

25 an anti-DNP antibody. 

Example 8. 

An alternative method of selecting for DHFR activity 



30 



The DHFR-catalysed reaction can be selected for by in situ coupling to a second reaction, 
catalysed by Yeast Aldehyde Dehydrogenase, with a 'tagged' substrate. 
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Instead of selecting for genes connected to one of the products of the DHFR reaction 
(5,6,7,8-tetrahydrofolate or NADP+) the DHFR reaction is coupled to a second reaction. 
Selection is in this case is mediated by the formation of the second product of the 
5 DHFR-catalysed reaction - nicotinamide adenine dinucleotide phosphate (NADP+). 

The reaction we have chosen to couple is catalysed by Yeast Aldehyde Dehydrogenase 
(YAD; EC 1.2.1.5). This enzyme uses either NAD+ or NADP+ in the oxidation of a wide 
range of aliphatic and aromatic aldehydes to their corresponding carboxylic acids, 

10 generating NADH or NADPH in the process. The reaction has the big advantage of being 
essentially irreversible - namely, dehydrogenases (including DHFR and YAD) do not 
catalyse the reduction of the acid back to the aldehyde. Since a large number of enzymes 
catalysing redox reactions generate NAD+ or NADP+ the YAD reaction can be used in 
the selection of these enzymes too, and is not limited solely to selection for Dihydrofolate 

15 Reductase. 

A pentaldehyde substrate is synthesised and linked to a DNP (dinitrophenyl) tag via a C 20 
linker (hereafter, DNP-PA). The oxidation of DNP-PA to the corresponding acid 
(DNP-PC) is followed and by HPLC (reverse phase C 18 column; H2O/CH3CN gradient + 
20 0.1% trifluoroacetic acid; retention times: DNP-PA, 5.0 mins; DNP-PC, 4.26 mins). 
Conversion of DNP-PA to DNP-PC is observed only in the presence of both YAD and 
NADP+. Reactions are also followed spectrophotometrically; the increase of absorbance at 
340nm indicated that NADP+ is simultaneously converted to NADPH. 

25 The coupled DHFR- YAD reaction is followed using the same HPLC assay. The initial 
reaction mixture contained the substrates for DHFR - NADPH (50 nM) and 
7-8-dihydrofolate (H 2 F; 50 \xM) 9 YAD (Sigma, 0.5 unit) and DNP-PA (50 \*M) in buffer 
pH 7.7 (100 mM imidazole, 5 mM p-mercaptoethanol and 25 mM KC1). Conversion of 
DNP-PA to DNP-PC is observed when DHFR is added to the above reaction mixture 

30 (DHFR 5 nM, 83 %.; 1.25 nM, 14.5 % after 32 mins). 
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The concentration of DHFR obtained in the compartmentalised in vitro translation is in fact 
much higher than 5 nM (see Example 4). The conversion of DNP-PA to DNP-PC is 
negligible in the absence of DHFR, or when methotrexate (MTX) - a potent inhibitor of 
the enzyme - is present (10 uM). Hence, the formation of the secondary product, DNP-PC, 
is therefore linked to the presence of the DHFR. 

Using this coupled reaction, proteins conferring DHFR activity can be selected by: i) 
linking the genes to antibodies that specifically bind the carboxylic product of DNP-PA, 
and ii) isolating these genes by affinity purification using an anti-DNP antibody. 

This approach is demonstrated by a routine immuno assay based on the catELISA (Tawfik 
et al., 1993). Microtiter plates are coated with anti-rabbit immunoglobulins (Sigma, 10 
ug/well) followed by rabbit polyclonal serum that specifically bind glutaric acid derivatives 
(Tawfik et al., 1993) diluted 1:500 in phosphate saline buffer + 1 mg/ml BSA). The plates 
are rinsed and blocked with BSA. The coupled reaction mixtures described above are 
diluted in Tris/BSA buffer (50 mM Tris, 150 mM sodium chloride, 10 mg/ml BSA, pH 
7.4) and incubated for 1 nr. The plate is rinsed and an anti-DNP antibody (mouse 
monoclonal SPE21.11) diluted in the same buffer (1: 10,000) is added and incubated for 
an hour. The plate is rinsed and peroxidase labelled anti mouse antibody (Jackson) is added 
followed by a peroxidase substrate (BM Blue; Boehringer Mannheim). A specific signal is 
observed only in the coupled reactions samples that contained DHFR (in addition to4l2F, 
NADPH, YAD and DNP-PA). 

Highly specific anti-carboxylic acid antibodies (Tawfik et al., 1993) are used for selection 
in two formats. 

In the first, the anti-carboxylic acid antibody is coupled chemically to a high molecular 
weight avidin (or streptavidin) containing complex such as that described in Example 5. 
Biotinylated DNA encoding DHFR is coupled to this complex via the avidin-biotin 
interaction as described in Example 5. This complex is then used in a compartmentalised 
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coupled transcription/translation system which also contains YAD and a tagged YAD 
substrate such as DNP-PA. If there is DHFR activity in the compartment the DNP-PA is 
converted to DNP-PC. The anti-carboxylic acid antibodies, coupled to the DNA via the 
high molecular weight complex will capture only DNP-PC molecules and not aldehyde 
5 molecules. DNA from those compartments containing active DHFR (and hence encoding 
active DHFR if there is only one molecule of DNA per compartment) are then affinity 
purified by using anti-DNP antibodies. 

In the second format, multiple streptavidin molecules are coupled together in a high 
10 molecular weight complex which can easily be coupled to biotinylated DNA encoding 
DHFR (see Example 5). This complex is used in a compartmentalised coupled 
transcription/translation system which also contains YAD and a YAD substrate such as 
MeNPOC-biotin-benzaldehyde. The biotin group in MeNPOC-biotin-benzaldehyde is 
"caged" (Sundberg et at., 1995; Pirrung and Huang, 1996), that is, it cannot be bound by 
15 avidin or streptavidin until a photoremovable nitrobenzyl group has been cleaved off by 
irradiation with light. If there is DHFR activity in the compartment the 
MeNPOC-biotin-benzaldehyde is converted to MeNPOC-biotin-benzoic acid. After the 
compartmentalised reaction has run for a while the reaction is irradiated with light and the 
nitrobenzyl group removed and the compound will bind to the streptavidin-DNA complex. 
20 DNA in those compartments containing active DHFR (and hence encoding active DHFR if 
there is only one molecule of DNA per compartment) is complexed with biotin-benzoic 
acid (instead of biotin-benzaldehyde) and can be affinity purified using immobilised 
anti-benzoic acid antibodies. 

25 The presence of other enzymes which can catalyse the oxidation NAD+ or NADP+ to 
NADH or NADPH in the in vitro transcription/translation system can under certain 
circumstances make it difficult to use this YAD system for selection directly in the 
compartmentalised in vitro transcription/translation system. In this case the selection is 
carried out using the two-step compartmentalisation system described earlier. That is, the 

30 DHFR is first translated in compartments and then linked to the DNA in the same 
compartment by means of a suitable affinity tag. The emulsion is broken, the contents of 
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the compartments pooled and the DNA affinity purified away from the other components of 
the transcription/translation system (including contaminating oxido-reductases), by using 
antibodies specific to a digoxigenin 'tag' attached to one end of the DNA molecule. The 
purified DNA molecules, together with the attached DHFR protein are then put into a 
5 reaction mixture contained the substrates for DHFR - NADPH (50 ^M) and 
7-8-dihydrofolate (H 2 F; 50 ^iM), YAD (Sigma, 0.5 unit) and DNP-PA (50 nM) in buffer 
pH 7.7 (100 mM imidazole, 5 mM P-mercaptoethanol and 25 mM KC1) and the reaction 
re-compartmentalised by emulsification to give only one, or at most a few, molecules of 
DNA per compartment. Anti-carboxylic acid antibodies (Tawfik et al., 1993) are used for 
10 selection in either of the two formats described above. 



Example 9. 

Methylation of genetic elements by gene products 



15 



DNA methyltransferases, produced by in vitro transcription/translation in the aqueous 
compartments of a water-in-oil emulsion, methylate the DNA molecules which encode 
them in the compartments. 

20 Selecting proteins with binding or catalytic activities using the compartmentalisation system 
described here presents two basic requirements: i) a single molecule of DNA (or at most a 
few molecules) encoding the proteins to be selected is expressed in a biologically active 
form by a coupled transcription/translation system in the aqueous compartments of a 
water-in-oil emulsion; and, ii) the protein to be selected must be able to modify the genetic 

25 element that encoded it in such a way as to make it selectable in a subsequent step. In this 
Example, we describe a group of proteins - DNA methyl transferases (type II) - that are 
produced efficiently in the aqueous compartments of a water-in-oil emulsion system using a 
coupled transcription/translation system. Furthermore, the in vitro translated DNA 
methyltransferases efficiently modify the DNA molecules which encode them in situ in the 

30 aqueous compartments so that they can be selected and amplified. The target sites on the 
DNA molecules are modified by methylation of a cytosine at the C5 position which renders 
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the sites resistant to cleavage by the cognate restriction endonuclease (i.e. Hhal for 
U.Hhal, and Haem for M.HaeW). Hence, methylated DNA is selectable over 
non-methylated DNA by virtue of its resistance to restriction endonuclease cleavage. 

The gene encoding U.Hhal is amplified by PCR using oligonucleotides HhaI-Fo2S and 
Hhal-Bc direcdy from Haemophilus parahaemolyticus (ATCC 10014). The gene encoding 
U.Haem is amplified by PCR using oligonucleotides HaeIII-Fo2s and HaeHI-Bc (SEQ. 
ID. No. 4) directly from Haemophilus influenzae (biogroup aegyptius) (ATCC 11116). 
Both PCR fragments are cloned into the vector pGEM-4Z (Promega) digested with Hindm 
and Kpnl downstream of the lac promoter and T7 RNA polymerase promoter. The 
oligonucleotides Hhal-Bc and HaeHI-Bc (SEQ. ID. No. 4) append the efficient phage 17 
gene 10 translational start site upstream of the methyltransferase gene start codon. 
Oligonucleotide Hhal-Fo appends an Hhal methylation/restriction site (M/R) and a HaeW 
Wotl) site to function as substrates for U.Hhal and U.Haem respectively. 
15 Oligonucleotide HaeDI-Fo appends a NotVHaeW M/R site which functions as a substrate 
for U.Haem. (the U.Haem gene already contains two internal Hhal M/R sites). DNA 
sequencing identifies clones with the correct nucleotide sequence. 



10 



20 



The pGEM-M.flfol and pGEU-U.Haem plasmids described above are amplified by PCR 
using primers LMB2-Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. NO. 10) as 
above to create either 1167 base pair DIG-M.flfoiI-Biotin or a 1171 base pair 
DIG-M.i/aein-Biotin DNA fragment, labelled at one end by biotin and the other end by 
digoxigenin, and which carry the T7 RNA polymerase promoter, the phage T7 gene 10 
translational start site, the methyltransferase gene and M/R sites of Haem and Hhal. The 
25 PCR fragments are each purified directly using Wizard PCR Preps (Promega). 

The genes required for the coupled in vitro transcription-translation of U.EcoRl and 
U.EcoRV are amplified by PCR using plasmids pMBl (Betlach et al., 1976) and pLBl 
(Bougueleret et al., 1984) respectively, as templates, a back primer appending the phage 
30 T7 gene 10 translational start site and LMB3 upstream of the methyltransferase gene 
ribosome binding site (EcoRI-Bc or EcoRV-Bc) and a forward primer (EcoRI-Fo or EcoRI- 
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Fo) appending LMB2. These fragments are further amplified by PCR using primers 
LMB2-Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. NO. 10) as described above to 
create the DIG-M.Eco RI-Biotin and DIG-M.£coRV-Biotin DNA fragments which carry 
the 17 RNA polymerase promoter, the phage T7 gene 10 translational start site, the 
5 methyltransferase gene and M/R sites of EcoRl and EcoRV. These PCR fragments are each 
purified directly using Wizard PCR Preps (Promega). 

The PCR-amplified DNA-methylases genes described above are expressed in a prokaryotic 
in vitro coupled transcription/translation system designed for linear templates (Lesley et 
10 al, 1991). A commercial preparation of this system is used (E. coli S30 Extract System for 
Linear Templates; Promega) supplemented with T7 RNA polymerase and S-adenosyl 
methionine (SAM) at 80 (xM concentration. 

Methylation is assayed by measuring the resistance of DNA fragments labelled with DIG 
15 and biotin to cleavage by the cognate restriction enzyme using the Boehringer-Mannheim 
DIG-Biotin ELISA or with radioactively labelled DNA fragments and streptavidin coated 
magnetic beads. In vitro reaction mixtures containing DIG-Biotin labelled fragments 
reacted in situ by coupled in vitro transcription-translation as described below are diluted in 
lxW&B buffer (1M NaCl, 10 mM Tris, 1 mM EDTA, pH 7.4) +0.1% Tween-20 (the 
20 concentration of the DIG/Biotin labelled DNA in the assay is in the range of 0-250 pM) 
and incubated in streptavidin coated microtiter plates (high capacity) for 30-60 mins. The 
plate is rinsed (3 times 2xW&B and finally with 50 mM Tris pH 7.4 + 5 mM MgCl2) and 
the restriction enzymes (NEB) are added (10-50 units enzyme in 0.2 ml of the 
corresponding buffer) and incubated at 37° for 3-12 hrs. The plate is rinsed and 
25 peroxidase-linked anti-DIG antibodies (diluted 1:1,500 in PBS + 0.1% Tween-20 + 2 
mg/ml BSA) are added for 40-60 min followed by the peroxidase substrate (BM Blue; 70 
Ml/well). The absorbance (at 450 minus 650nm) is measured after quenching with 0.5M 
H2SO4 (130 /tl/well). 



30 For the radioactive assay, the plasmids and PCR fragments described above are amplified 
by PCR using primers LMB2-Biotin (SEQ. ED. No. 9) and LMB3 and a-P 32 -CTP to give 
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P32-labelled DNA fragments labelled at one end by biotin and which carry the T7 RNA 
polymerase promoter, the phage 17 gene 10 translation^ start site, the methyltransferase 
gene and the relevant M/R sites. These PCR fragments are purified directly using Wizard 
PCR Preps (Promega). Reaction mixtures containing the Biotin-P32-labelled DNA reacted 
5 in situ by coupled in vitro transcription-translation are diluted in lxW&B buffer + 0.1% 
Tween-20 and incubated with streptavidin coated magnetic beads (Dynal, M-280; 1-5 xlO 6 
beads) for 30-60 mins. The beads are separated and rinsed (3 times 2xW&B + 0.1% 
Tween-20 + 3% BSA and finally with 50 mM Tris pH 7.4 + 5 mM MgCl2). The 
restriction enzymes (NEB) are added (10-50 units enzyme in 50-150 M l of the 
10 corresponding buffer) and incubated at 37° for 5-20 hrs. The supernatant is removed and 
the beads rinsed and resuspended in 100 /d water. The amount of radioactively-labelled 
DNA on the beads and in the supernatants is determined by scintillation. 

All four methylases described here - M.Haelll, M.Hhal, M.EcoBl and M.EcoRV - are 
15 expressed and active in the in vitro coupled transcription/translation. Furthermore, the in 
vitro translated methylase can methylate its own gene thus rendering it resistant to cleavage 
by the cognate methylase (self-methylation). Both processes, the coupled in vitro 
transcription-translation of the methylase gene as well as its methylation proceed efficiently 
in the same reaction mixture. More specifically, DNA fragments (at 0.5 to 10 nM 
20 concentrations) which carry the T7 RNA polymerase promoter, the phage T7 gene 10 
translational start site, a methyltransferase gene and M/R sites of all four methylases 
become resistant to cleavage by the cognate restriction endonuclease. For example, the 
DNA fragment encoding M.£coRI methyltransferase becomes resistant to cleavage by 
EcoM (75-100% after 20-90 minutes at 25°C) when incubated with E. coli S30 Extract 
25 System for Linear Templates (Promega), SAM (80 pM) and T7 RNA polymerase. The 
resistance to cleavage as a result of methylation is selective and specific: under the same 
conditions, resistance to cleavage by Hhal or M.EcoRV is not observed; moreover, 
resistance to cleavage by £coRI is not observed when translation is inhibited (e.g. in the 
presence of puromycin or in the absence of T7 RNA polymerase). Similar results where 
30 obtained when survival of the genes is assayed by DIG-Biotin EUSA or with 
Biotin-p32-iabelled DNA fragments as described above. Methylation in trans, i.e., of DNA 
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fragments (other than those encoding for the cognate methylase) appending M/R sites is 
also observed in the E. coli S30 coupled in vitro transcription-translation system in the 
presence of a gene encoding for a methylase. 

Both processes, the coupled in vitro transcription-translation of the methylase genes as well 
as their self-methylation proceed efficiently in the aqueous compartments of a water-in-oil 
emulsion. More specifically, DNA fragments (at 0.1-10 nM concentrations) which carry 
the T7 RNA polymerase promoter, the phage T7 gene 10 translational start site, the 
methyltransferase gene (for example, M.Hhal) and the M/R sites of HaeUl, Hhal and 
EcoRl are added to £. coli S30 Extract System for Linear Templates (Promega) in the 
presence of SAM (80 fM) and T7 RNA polymerase. The ice cooled reaction mixtures are 
emulsified by homogenising for 1 minute with an Ultra-Turrax T25 disperser at 20,000 
rpm as described in Example 1 and incubated at 25°-30° for 0-180 mins. The reaction is 
stopped and the aqueous phase is separated (see Example 1) and the methylation of the 
DIG-Biotin or Biotin-P 32 -labelled DNA fragments is assayed as described above. 
Methylation of up to 20% of the compartmentalised genes to cleavage by Hhal is observed 
after 60-180 mins incubation. No resistance is observed when the ice-cold emulsion is 
broken just after it is made and the reaction quenched by ether extraction ('0 mins 1 ). The 
methylation is selective: under the same conditions, resistance to cleavage by HaeUl or 
EcoRI is not observed. Moreover, the assay of P 32 -labelled DNA fragments has shown that 
self-methylation of both M.HaeUl and M.Hhal proceed at concentrations of genes that 
correspond to an average of less than one gene per compartment (0. 1-0.5 nM; see Example 
4). Thus, the coupled in vitro transcription-translation of the methylases genes as well as 
their self-methylation proceed efficiently even when only a single genetic element is present 
in aqueous compartments of the water-in-oil emulsion. 

Haelll methylase activity resulting from coupled transcription/translation of M.Haelll 
genes is also measured in the larger microcapsules produced by stirring alone, and by 
stirring followed by further homogenisation with an Ultra-Turrax T25 disperser at 8,000 
rpm, 9,000 rpm, or 13,500 rpm for 1 minute as described in Example 1. The results are 
presented in Figure 2b. The concentration of M.HaeUl genes used (2.5 nM) gives an 
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average of I, 1.5 and 4.8 genetic elements per droplet in the emulsions homogenised at 
13,500 rpm, 9,500 rpm and 8,000 rpm, respectively, and an average of 14 genetic 
elements per droplet in the emulsion prepared by stirring only. The addition of an anionic 
surfactant - e.g., sodium deoxycholate, typically to 0.5% (w/v), to the in vitro translation 
5 mixture significantly increases the percentage of genetic elements methylated in the 
emulsions. 

Example 10, 

Genetic elements encoding DNA methyltransferases can be selected and amplified 
10 following their self-methylation in the aqueous compartments of a water-in-oil 
emulsion. 

The methylation of genes encoding for DNA-methylases allows them to be isolated and 
amplified in a subsequent step. The methylated DNA is selectable over non-methylated 

15 DNA by virtue of its resistance to restriction endonuclease cleavage. Thus, the genetic 
elements that remain intact after treatment with the cognate restriction enzyme can be 
amplified by PCR. However, such a selection is obviously unattainable if other genes that 
contain the same R/M site but do not encode for the methylase are present in same reaction 
mixture. This is because cross-methylation of the non-methylase genes (that are present at a 

20 large excess) will render them resistant to cleavage by the cognate restriction enzyme and 
thus amplifiable by PCR. Under these conditions, selection of genes encoding the 
methylase will become possible only if they are compartmentalised - namely, if only one, 
or few genes are present in a single compartment so that self methylation is the major 
process in that compartment. Cross-methylation is avoided since non-methylase genes that 

25 are present in compartments that do not contain a methylase gene will remain 
un-methylated. 

The genes used in the experiment are a 1194 base pair M.HaelYL fragment (DIG-M.//adII- 
3s-Biotin) encoding methylase Haelll and a 681 base pair/o/A fragment (DIG-/o/A-3s-Biotin) 
30 encoding the enzyme dihydrofolate reductase (DHFR) containing additional HaeUi and Hhal 
restriction/modification sites (See Fig. lb). Both DNA fragments are labelled at one end with 
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digoxigenin (DIG) and the other with biotin, and contain a T7 RNA polymerase promoter 
(T7 Promoter) and T7 gene 10 translational initiation site (rbs) for expression in vitro. 

pGEM-4Z-3s is created by annealing oligonucleotides HaeHha-Pl and HaeHha-Mi (SEQ. ID. 
5 No. 2) (Table 1) and ligating them into Hindm and EcoM cut pGEM-4Z (Promega). The 
M. Haelll gene is amplified by PCR from Haemophilus influenzae (biogroup aegyptius) 
(ATCC 11116) using oligonucleotides Haeffl-FoNC (SEQ. ID. No. 3) and Haelll-Bc (SEQ. 
ID. No. 4) (Table 1). The folk gene is amplified from Escherichia coli using primers 
EDHFR-Fo (SEQ. ID. No. 5) and EDHFR-Ba (SEQ. ID. No. 6) (Table 1). Both amplified 

10 genes are digested with Hindlll and Kpnl and cloned into pGEM-4Z-3s, creating the 
expression vectors pGEM-/faeffl-3s and pGEM-/<?/A-3s. DIG-M.#aeIII-3s-Biotin and DIG- 
/o/A-3s-Biotin (see Fig. lb) are amplified from these vectors by PCR with Pfu polymerase 
using primers LMB2-Biotin (SEQ. ID. No. 9) and LMB3-DIG (SEQ. ID. NO. 10) (20 
cycles) and purified using Wizard PCR Preps (Promega). DIG-D1.3-Biotin, a 942 bp DNA 

15 fragment containing four Haelll R/M sites used as a substrate to assay HaeUl methylase 
activity, is amplified from a pUC19 derivative containing a D1.3 single-chain Fv gene 
(McCafferty et al. , 1990) as above. A 558 bp carrier DNA (g3 carrier DNA; an internal 
fragment of phage fd gene III which has no T7 promoter, Haelll or Hhal R/M sites) is 
amplified by PCR with Taq polymerase from pHENl DNA (Hoogenboom et al , 1991) using 

20 primers G3FRAG-Fo (SEQ. ID. No. 11) and G3FRAG-Ba (SEQ. ID. No. 12) (Table 1) and 
purified by phenol-chloroform extraction and ethanol precipitation. This DNA (at ^10 nM) 
was used as a carrier in dilution of all DNA used for the reactions in this example. 



25 Figure 3 demonstrates the selection of M. Haelll genes encoding the DNA methylase Haelll 
from an excess of folk genes (encoding DHFR which does not methylate DNA). Both 
genes have the same Haelll R/M sequences appended to act as a substrate (Fig. lb). After 
translation in the aqueous compartments of an emulsion the Haelll R/M sequences attached 
to methylase genes are methylated. These genes are rendered resistant to cleavage by 

30 Haelll endonuclease and are subsequendy amplified by PCR. folk genes, present in other 
compartments, remain unmethylated, are cleaved and not amplified. The PCR products are 
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analysed by agarose gel electrophoresis where enrichment for the M.Haelll genes can be 
visualised by the appearance of a 1194 bp band (Fig. 3). 



The E, coli S30 extract system for linear DNA (Promega) is used, supplemented with g3 
5 carrier DNA (10 nM), DNA fragments (DIG-M.//adII-3s-Biotin and DIG-M./o/A-3s- 
Biotin at ratios and concentrations indicated below), T7 RNA polymerase (10 4 units), 
sodium deoxycholate (Fluka, 0.5% w/v; in emulsified reactions only) and S-adenosyl 
methionine (NEB, 80 \iM). Reactions are set up using DNA fragments DIG-M.//fleIII-3s- 
Biotin and DIG-M./tf/A-3s-Biotin at a ratio of 1:10 3 and at a total concentration of 200 pM 

10 (Fig. 3a) and ratios of 1.10 4 to 1:10 7 and a total concentration of 500 pM (Fig. 3b). Fifty 
microliter reactions are prepared on ice and emulsified by stirring only as described in 
Example 1. Reactions are incubated for 2 hours at 25 °C. To recover the reaction mixtures, 
the emulsions are spun at 3,000 g for 5 minutes and the oil phase removed leaving the 
concentrated emulsion at the bottom of the vial. Quenching buffer (0.2 ml of 25 jxg/ml 

15 yeast RNA in W+B buffer: 1 M NaCl, 10 mM Tris-HCl, 1 mM EDTA pH 7.4) and 2 ml 
of water-saturated ether are added and the mixture is vortexed, centrifuged briefly, and the 
ether phase removed. The aqueous phase is washed with ether and dried (5 minutes in a 
Speedvac at ambient temperature). DNA is captured on 100 fig M-280 streptavidin 
Dynabeads (2 hours at ambient temperature). The Dynabeads are washed sequentially with: 

20 W+B buffer; 2.25 M Guanidine-HCl, 25 mM Tris-HCl, pH 7.4; W+B buffer; and, 
twice with restriction buffer. Beads are resuspended in 100 /d restriction buffer containing 
10 units Haelll (or Hhal) and incubated at 37°C for 5 hours. The beads are washed three 
times with W+B buffer, twice with 50 mM KC1, 10 mM Tris-HCl, 0.1% Triton X-100, 
pH 9.0, and then resuspended in 100 jil of the same buffer supplemented with 1.5 mM 

25 MgCl 2 (PCR buffer). Aliquots of beads (2-20 jd) are amplified by PCR using Taq 
polymerase added at 94°C with primers LMB2-Biotin and LMB3-DIG (50 fil reactions; 32 
cycles of 1 minute at 94°, 1 minute at 55°, 2 minutes at 72°). This DNA is purified using 
Wizard PCR Preps and used for the second round of selection (20 pM in the 1:10* and 
1:10 5 selections and 500 pM in the 1:10 s and 1:10 7 selections). For gel electrophoresis and 

30 activity assays this DNA (diluted to ~ IpM) is further amplified with primers LMB2-Nest 
and LMB3-Nest which anneal immediately inside LMB2 and LMB3 respectively (25 cycles 
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of 1 minute at 94°, 1 minute at 50°, 1.5 minutes at 72°) and purified as above. This DNA 
(at 10 nM), which has neither DIG nor Biotin appended, is also translated in vitro in the 
presence of 10 nM DIG-D1.3-Biotin, a 942 bp DNA containing four Haelll R/M sites. 
Methylation of the DIG-D1.3-Biotin substrate is determined by DIG-Biotin ELISA as 
5 Example 9. 

A single round of selection of a 1:1000 ratio of M.Haelil : folA genes in the emulsion results 
in a roughly 1:1 final gene ratio (Fig. 3a). Several control experiments indicate that selection 
proceeds according to the mechanism described above: a band corresponding to the M. Haelll 

10 gene is not observed when the initial mixture of genes is amplified by PCR; nor after reaction 
in solution (non-emulsified); nor when emulsified in the absence of transcription/translation 
(when T7 RNA polymerase is omitted); nor when the reacted genes are cleaved at R/M sites 
other than those of Haelll - e.g., after digestion with Hhal. The yield of M. Haelll DNA 
after selection is less than 100% primarily due to incomplete digestion by Haelll rather than 

15 cross-methylation as indicated by the large folA band observed in the absence of methylase 
activity (when T7 polymerase is not added). During digestion, the concentration of DNA 
drops well below the K M of Haelll (6 nM) and digestion becomes extremely inefficient. 

A band corresponding to M.Haelil genes also becomes visible after a single-round of 
20 selection starting from M.Haelil: folA ratios of 1:10 4 to 1:10 5 (Fig. 3b), but not at lower 
ratios, indicating an enrichment factor of at least 5000-fold. Selection of a small number of 
genes from a large pool (e.g., a gene library) therefore requires further rounds of selection. 
When the /foelll-digested and amplified DNA from the first round of selection is subjected to 
a second round of selection, a band corresponding to M.Haelil genes also became visible 
25 from 1:10* and 1:10 7 starting ratios of M.Haelil: folA. A second round of selection is also 
performed on the DNA derived from the 1:10 4 to 1:10 5 starting ratios of M.Haelil: folA. 
This gives a further enrichment, up to a ratio of approximately 3:1 in favour of the M. Haelll 
genes. Before and after each round of selection the genes are amplified, translated in vitro 
and reacted with a separate DNA substrate to assay for Haelll methylase activity. These 
30 assays indicate that enrichment for the M.Haelil genes as observed by gel electrophoresis 
results in a parallel increase in Haelll methylase activity (Fig. 3b). 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MEDICAL RESEARCH COUNCIL 

(B) STREET: 20 PARK CRESCENT 

(C) CITY: LONDON 

(E) COUNTRY: UK 

(F) POSTAL CODE (ZIP) : WIN 4AL 

(ii) TITLE OF INVENTION: IN VITRO SORTING METHOD 
(iii) NUMBER OF SEQUENCES: 23 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc m "SYNTHETIC OLIGONUCLEOTIDE " 

(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
AGCTTGCATG CCTGCGGTAC CGGCCATGCG CATGGCCTAG CGCATGCGGC CGCTAGCGCG 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AATTCGCGCT AGCGGCCGCA TGCGCTAGGC CATGCGCATG GCCGGTACCG CAGGCATGCA 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION; /desc - "SYNTHETIC OLIGONUCLEOTIDE " 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CGAGCTAGAG GTACCTTATT AATTACCTTT ACAAATTTCC AATGCAGATT TTAT 54 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCATCTGACA AGCTTAATAA TTTTGTTTAA CTTTAAGAAG GAGATATACA TATGAATTTA 60 
ATTAGTCTTT TTTCAGGTGC AGGG 84 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE " 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGAGCTAGAG GTACCTTATT ACCGCCGCTC CAGAATCTCA AAGCAATAG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE 11 

(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCATCTGACA AGCTTAATAA TTTTGTTTAA CTTTAAGAAG GAGATATACA TATGATCAGT 
CTGATTGCGG CGTTAGCGGT AG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GAATTGGATT TAGGTGAC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE " 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



10 



15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CATGATTACG CCAAGCTC 
(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
20 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 



18 



25 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTAAAACGAC GGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 10: 



17 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
{B) TYPE : nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE 11 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAGGAAACAG CTATGAC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTCTCTGAAT TTACCGTTCC AG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE " 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAAACTGTTG AAAGTTGTTT AG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATTATAATAC GACTCACTAT AGGGAGAGTT ATCAGGCATG CACC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CTAGCTCCCA TTAAGGAG 

(2) INFORMATION FOR S£Q ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE " 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15- 

15 

GTAAAACGAC GGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE n 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAGGAAACAG CTATGAC 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

1 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGAGCTAGAG GTACCGCGGC CGCTGCGCTT ATTAATATGG TTTGAAATTT AATGATGAAC 60 
CAATG 

65 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 



(iii) HYPOTHETICAL: NO 
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<iv) ANTI -SENSE: NO 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCATCTGACA AGCTTAATAA TTTTGTTTAA CTTTAAGAAG GAGATATACA TATGATTGAA < 
ATAAAAGATA AACAGCTCAC AGG { 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGAGCTAGAG GTACCGCGGC CGCTGCGCTT ATTAATTACC TTTACAAATT TCCAATGCAG 60 
ATTTTAT 67 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CAGGAAACAQ CTATGACAAG CTTAATACGA CTCACTATAG GGAGATATTT TTTATTTTAA 60 
TAAGGTTTTA ATTAATGG 

78 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE : NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTAAAACGAC GGCCAGTGAA TTCTTATTAC TTTTGTAATC GTTTGTTTTT TATC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYt>E: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CAGGAAACAG CTATGACAAG CTTAATACGA CTCACTATAG GGAGAAATGG GTTTCTTTGG 
CATATTTTTT ACAAATG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC OLIGONUCLEOTIDE " 

(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GTAAAACGAC GGCCAGTGAA TTCGATATCT TATTACTCTT CAATTACCAA AATATCCCC 
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CLAIMS 



0 



1. A method for isolaUng one or more genetic elements encoding a gene product 
having a desired activity, comprising the steps of: 

(a) compartmentalising the genetic elements into microcapsules; 

(b) expressing the genetic elements to produce their respective gene products 
within the microcapsules; 

(c) sorting the genetic elements which produce gene produces) having the 
desired activity. 

2. A method according to claim 1 wherein a gene product with the desired activity 
induces a change in the microcapsule which enables the microcapsule containing the 
gene product and the genetic element encoding it to be sorted. 

3. A method according to claim 2 wherein a gene product having the desired 
activity modifies one or more molecules within the microcapsule which enables the 
microcapsule containing the gene product and the genetic element encoding it to be 
sorted. 



4. A method according to claim 2 or claim 3 wherein the gene product having the 
desired activity induces a change in the optical properties of the microcapsule and the 
microcapsules are sorted thereby. 

5. A method according to claim 1, wherein in step (b) the activity of the desired 
gene product within the microcapsule results, directly or indirectly, in the modification 
of the genetic element encoding the gene product to enable the isolation of the genetic 
element. 

6. A method according to claim 5 wherein the gene product having the desired 
activity induces a change in the microcapsule which, when detected, triggers the 



5 
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modification of the genetic element within the compartment to enable the isolation of 
the genetic element. 

7. A method according to claim 6 wherein the change in the microcapsule is a 
change in the optical properties thereof. 

8. A method according to claim 5 wherein a part of the genetic element is a 
substrate and the activity of the desired gene product within the microcapsule results, 
direcdy or indirectly, in the conversion of said substrate into a product which remains 

10 pan of the genetic element and enables its isolation. 

9. A method according to claim 5 wherein a part of the genetic element is a ligand 
and the desired gene product within the microcapsule binds, directly or indirectly, to 
said ligand to enable the isolation of the genetic element. 



15 



20 



10. A method according to any preceding claim wherein the activity of the desired 
gene product within the microcapsule results, direcdy or indirecdy, in the alteration of 
the expression of a second gene within the compartment and the activity of the product 
of the said second gene enables the isolation of the genetic element. 

11. A method according to claim 1 , wherein step (b) comprises: 

expressing the genetic elements to produce their respective gene products within 

the microcapsules, linking the gene products to the genetic elements encoding them and 
isolating the complexes thereby formed. 

12. A method according to claim 11, wherein in step (c) the complexes are direcdy 
sorted to isolate genetic elements encoding a gene product having the desired activity. 

13. A method according to claim 12, wherein the complexes are subjected to a 
30 further compartmentalisation step in order to isolate the genetic elements encoding a 

gene product having the desired activity. 



25 
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14. A method according to claim 13 wherein the microcapsules are sorted according 
to claims 2 to 4. 



5 15. A method according to claim 12 wherein the genetic elements are sorted 
according to claims 5 to 10. 

16. A method according to any preceding claim, wherein the genetic elements are 
isolated from a library of genetic elements encoding a repertoire of gene products. 

10 

17. A method according to any preceding claim further comprising the additional 
step of: 

(d) introducing one or more mutations into the genetic elements) isolated in step 

(c). 

15 

18. A method according to any preceding claim further comprising iteratively 
repeating one or more of steps (a) to (d). 

19. A method according to any preceding claim further comprising amplifying the 
genetic elements. 

20. A method for compartmentalising a genetic element and expressing the genetic 
element to form its gene product within the compartment, comprising the steps of: 

(a) forming an aqueous solution comprising the genetic element and the 
components necessary to express it to form its gene product; 

(b) microencapsulating the solution so as to form a discrete microcapsule 
comprising the genetic element; and 

(c) exposing the microcapsule to conditions suitable for the expression of the 
genetic element to form its gene product to proceed. 
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21 . A method according to claim 20, wherein a library of genetic elements encoding 
a repertoire of gene products is encapsulated and each genetic element expressed to 
form its gene product within a microcapsule. 

5 22. A method according to claim 20 or claim 21, wherein microencapsulation is 
achieved by forming a water-in-oil emulsion of the aqueous solution in an oil-based 
medium. 



23. A microcapsule obtainable by the method of any one of claims 20 to 22. 

0 

24. A method according to any one of claims 5 to 19, wherein the genetic elements 
are sorted by affinity purification. 

25. A method according to any one of claims 5 to 19, wherein the genetic elements 
! are sorted by selective ablation of genetic elements which do not encode the desired 

gene product. 



26. A product when isolated according to the method of any preceding claim. 

27. A method for preparing a gene product, comprising the steps of: 

(a) preparing a genetic element encoding the gene product; 

(b) compartmentalising genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products 
within the microcapsules; 

(d) sorting the genetic elements which produce the gene produces) having the 
desired activity; and 

(e) expressing the gene product having the desired activity. 

28. A method for screening a compound or compounds capable of modulating the 
activity of a gene product, comprising the steps of: 

(a) preparing a repertoire of genetic elements encoding gene product; 
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(b) compartmentalising the genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products 
within the microcapsules; 

(d) sorting the genetic elements which produce the gene produces) having the 
desired activity; and 

(e) contacting a gene product having the desired activity with the compound or 
compounds and monitoring the modulation of an activity of the gene product by the 
compound or compounds. 

29. A method for preparing a compound or compounds comprising the steps of: 

(a) providing a synthesis protocol wherein at least one step is facilitated by a 
polypeptide; 

(b) preparing genetic elements encoding variants of the polypeptide which 
facilitates this step; 

(c) compartmentalising the genetic elements into microcapsules; 

(d) expressing the genetic elements to produce their respective gene products 
within the microcapsules; 

(e) sorting the genetic elements which produce polypeptide gene produces) 
having the desired activity; and 

(0 preparing the compound or compounds using the polypeptide gene product 
identified in (e) to facilitate the relevant step of the synthesis. 
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